This is FURRY-CENTRIC site rip originated from e621:net imageboard grabbed mostly via tbib:org crossposting:
This rips is not intended to be “complete and maximum quality” but rather "representative the best of"
to help anybody to open the furry world while not bumping into yiff (furry hentai, often male/male) and comix stockpiles
Another reason is neural network training over art images.
There are promising results for specie-specific head classes (dragonhead, ponyhead, Judy Hopps, Nick Wilde, …), stay tuned.
Manually:
Also a lot of handjob manual filtering done to avoid obviously unsafe art and throttle most of furry fetishes.
Despite furry is not SFW by definition, (almost) no frontal nudity and evident adult activity left here so R14+ seems applicable.
exiftool -filecreatedate -imagesize -filesize# -filetype -JPEGQualityEstimate -csv -r B:\TBIB\ > exif.txt
select 'magick convert "'||sourcefile||'" '||
case when iw/ih between 0.8 and 1.2 and px>4000000 then '-resize 1920x1920^>'
when iw/ih<0.8 and px>5000000 then '-resize 2480x2480^>'
when iw/ih>1.2 and px>6000000 then '-resize 2560x2560^>'
else to_char(null) end||' '||
case when jq>=98 then '-quality 94' else to_char(null) end||' '||
case when filesize/(iw*ih)>0.7 then '-blur 4' else to_char(null) end||
' "'||replace(sourcefile,'\tbib\','\tbic\')||'"' mm
from exif e
where ( jq between 98 and 100
or (iw/ih between 0.8 and 1.2 and px>4000000)
or (iw/ih<0.8 and px>5000000)
or (iw/ih>1.2 and px>6000000) )
and ((filesize>1600000 and jq>84) or filesize>4000000 or (filesize/(iw*ih)>0.7) )
order by fpath desc, fname
.
.
FID - imageboard post ID (e621 when < 1000000, tbib when >= 5000000)
for torrent content
FPATH - folder / zip name
FNAME - file name
TORR_FSIZE - file size, bytes
TORR_ISIZE - image size WxH
TORR_JQ - JPEG quality
TORR_MD5 - checksum
imageboard originated if available
ORIG_DT - posting date
ORIG_RATE - Safe / Questionable
ORIG_ISIZE - WxH
ORIG_EXT - image type (extension)
ORIG_MD5 - checksum
imagemagick:org calculated
TENTR - enthropy (complexity)
TSKEW - skewness (black/white balance)
TSTDDEV - (black/white contrast)
TCOLORS - count of colors
keras-craft text detector calculated
TXSIZE - total text area
TXCNT - number of text pieces
.
.
FID - imageboard post ID
TAG - string tag
TAG_CAT - tag category COPYRIGHT / CHARACTER / SPECIE / ARTIST / GENERAL or UNKNOWN
Comments - 0