Furry (the best of) E621 via TBIB almost SFW sampled with metadata

Category:
Date:
2022-10-01 12:21
Submitter:
Seeders:
3
Information:
No information.
Leechers:
0
File size:
162.1 GiB
Completed:
49
Info hash:
12300d111410f19f3ede0ffb9bf2adaf27dbda7e

This is FURRY-CENTRIC site rip originated from e621:net imageboard grabbed mostly via tbib:org crossposting:

  • TBIB for interval 04.2016-07.2022 post ID 5.000.000 - 11.000.000
  • E621 only topmost up to 07.2016 post ID 10.000 - 999.999

This rips is not intended to be “complete and maximum quality” but rather "representative the best of"
to help anybody to open the furry world while not bumping into yiff (furry hentai, often male/male) and comix stockpiles

Another reason is neural network training over art images.
There are promising results for specie-specific head classes (dragonhead, ponyhead, Judy Hopps, Nick Wilde, …), stay tuned.

Manually:

  • comic and 4koma, most of line-arts, segmented scans and overtexted covers filtered out
  • crops done when large simple or dirty background, occationally gamma correction and other nontrivial improvements made

Also a lot of handjob manual filtering done to avoid obviously unsafe art and throttle most of furry fetishes.
Despite furry is not SFW by definition, (almost) no frontal nudity and evident adult activity left here so R14+ seems applicable.

This release contains:

  • 279.736 JPG images
    • renamed to contain ID - up_to_3_copyrights ~ up_to_5_species_or_characters (up_to_2_artists)
    • PNG >> JPG (94% quality) converted, some of them “sampled” to reasonable size / volume
    • deduplicated using AntiDupl up to 4% similarity
    • splitted / zipped into folders by ID range and also Questionable and eXtra separated (use MaxView or unzip to browse)
      .
  • additional TSV (tab separated text) metadata
    • key parameters for every image (from imageboard and released) spreadsheet capable
    • tag-to-image relations - 8.167.078 rows; involve some tool to use
      .

More about sampling

  1. detected image properties
exiftool -filecreatedate -imagesize -filesize# -filetype -JPEGQualityEstimate -csv -r B:\TBIB\ > exif.txt
  1. sophisticatedly used
select 'magick convert "'||sourcefile||'" '||
  case when iw/ih between 0.8 and 1.2 and px>4000000 then '-resize 1920x1920^>'
       when iw/ih<0.8                 and px>5000000 then '-resize 2480x2480^>'
       when iw/ih>1.2                 and px>6000000 then '-resize 2560x2560^>'         
       else to_char(null) end||' '||
  case when jq>=98 then '-quality 94' else to_char(null) end||' '||    
  case when filesize/(iw*ih)>0.7 then '-blur 4' else to_char(null) end||
  ' "'||replace(sourcefile,'\tbib\','\tbic\')||'"' mm
from exif e
where ( jq between 98 and 100
     or (iw/ih between 0.8 and 1.2 and px>4000000) 
     or (iw/ih<0.8                 and px>5000000) 
     or (iw/ih>1.2                 and px>6000000) ) 
  and ((filesize>1600000 and jq>84) or filesize>4000000 or (filesize/(iw*ih)>0.7) )
order by fpath desc, fname
  1. image left untouched when minimal or negative effect of sampling
    .

More about metadata

.

TBIB_E621_2022.tsv

.
FID - imageboard post ID (e621 when < 1000000, tbib when >= 5000000)
for torrent content
FPATH - folder / zip name
FNAME - file name
TORR_FSIZE - file size, bytes
TORR_ISIZE - image size WxH
TORR_JQ - JPEG quality
TORR_MD5 - checksum
imageboard originated if available
ORIG_DT - posting date
ORIG_RATE - Safe / Questionable
ORIG_ISIZE - WxH
ORIG_EXT - image type (extension)
ORIG_MD5 - checksum
imagemagick:org calculated
TENTR - enthropy (complexity)
TSKEW - skewness (black/white balance)
TSTDDEV - (black/white contrast)
TCOLORS - count of colors
keras-craft text detector calculated
TXSIZE - total text area
TXCNT - number of text pieces
.

TBIB_E621_2022_TAGS.tsv

.
FID - imageboard post ID
TAG - string tag
TAG_CAT - tag category COPYRIGHT / CHARACTER / SPECIE / ARTIST / GENERAL or UNKNOWN

Aogami_racing

File list

  • TBIB_E621_2022
    • 01eeeeee.q.zip (906.1 MiB)
    • 01eeeeee.zip (577.4 MiB)
    • 050xxxxx.zip (579.9 MiB)
    • 051xxxxx.zip (1018.6 MiB)
    • 052xxxxx.zip (1.3 GiB)
    • 053xxxxx.zip (726.6 MiB)
    • 054xxxxx.zip (1.6 GiB)
    • 055xxxxx.zip (1.5 GiB)
    • 056xxxxx.zip (1.9 GiB)
    • 057xxxxx.zip (1.3 GiB)
    • 058xxxxx.zip (1.3 GiB)
    • 059xxxxx.zip (1.6 GiB)
    • 060xxxxx.q.zip (557.1 MiB)
    • 060xxxxx.zip (1.6 GiB)
    • 061xxxxx.q.zip (539.2 MiB)
    • 061xxxxx.zip (1.4 GiB)
    • 062xxxxx.q.zip (558.0 MiB)
    • 062xxxxx.zip (1.6 GiB)
    • 063xxxxx.q.zip (523.5 MiB)
    • 063xxxxx.zip (1.6 GiB)
    • 064xxxxx.q.zip (621.0 MiB)
    • 064xxxxx.zip (1.6 GiB)
    • 065xxxxx.q.zip (665.7 MiB)
    • 065xxxxx.zip (1.8 GiB)
    • 066xxxxx.q.zip (718.0 MiB)
    • 066xxxxx.zip (1.8 GiB)
    • 067xxxxx.q.zip (747.4 MiB)
    • 067xxxxx.zip (1.8 GiB)
    • 068xxxxx.q.zip (690.6 MiB)
    • 068xxxxx.zip (1.5 GiB)
    • 069xxxxx.q.zip (649.1 MiB)
    • 069xxxxx.zip (1.6 GiB)
    • 070xxxxx.q.zip (710.4 MiB)
    • 070xxxxx.zip (1.6 GiB)
    • 071xxxxx.q.zip (657.0 MiB)
    • 071xxxxx.zip (1.6 GiB)
    • 072xxxxx.q.zip (648.3 MiB)
    • 072xxxxx.zip (1.6 GiB)
    • 073xxxxx.q.zip (722.4 MiB)
    • 073xxxxx.zip (1.6 GiB)
    • 074xxxxx.q.zip (688.1 MiB)
    • 074xxxxx.zip (1.5 GiB)
    • 075xxxxx.q.zip (735.5 MiB)
    • 075xxxxx.zip (1.5 GiB)
    • 076xxxxx.q.zip (971.0 MiB)
    • 076xxxxx.zip (2.2 GiB)
    • 077xxxxx.q.zip (1.1 GiB)
    • 077xxxxx.zip (2.3 GiB)
    • 078xxxxx.q.zip (1012.9 MiB)
    • 078xxxxx.zip (2.1 GiB)
    • 079xxxxx.q.zip (870.0 MiB)
    • 079xxxxx.zip (2.6 GiB)
    • 080xxxxx.q.zip (739.7 MiB)
    • 080xxxxx.zip (2.3 GiB)
    • 081xxxxx.q.zip (747.0 MiB)
    • 081xxxxx.zip (2.0 GiB)
    • 082xxxxx.q.zip (733.5 MiB)
    • 082xxxxx.zip (2.0 GiB)
    • 083xxxxx.q.zip (755.2 MiB)
    • 083xxxxx.zip (2.2 GiB)
    • 084xxxxx.q.zip (669.9 MiB)
    • 084xxxxx.zip (1.9 GiB)
    • 085xxxxx.q.zip (660.5 MiB)
    • 085xxxxx.zip (1.8 GiB)
    • 086xxxxx.q.zip (793.9 MiB)
    • 086xxxxx.zip (1.8 GiB)
    • 087xxxxx.q.zip (678.2 MiB)
    • 087xxxxx.zip (1.7 GiB)
    • 088xxxxx.q.zip (697.3 MiB)
    • 088xxxxx.zip (1.8 GiB)
    • 089xxxxx.q.zip (702.7 MiB)
    • 089xxxxx.zip (2.1 GiB)
    • 090xxxxx.q.zip (1.3 GiB)
    • 090xxxxx.zip (2.4 GiB)
    • 091xxxxx.q.zip (1.1 GiB)
    • 091xxxxx.zip (1.9 GiB)
    • 092xxxxx.q.zip (1.2 GiB)
    • 092xxxxx.zip (2.0 GiB)
    • 093xxxxx.q.zip (1.5 GiB)
    • 093xxxxx.zip (2.5 GiB)
    • 094xxxxx.q.zip (1.4 GiB)
    • 094xxxxx.zip (2.2 GiB)
    • 095xxxxx.q.zip (618.2 MiB)
    • 095xxxxx.zip (1.1 GiB)
    • 096xxxxx.q.zip (438.7 MiB)
    • 096xxxxx.zip (766.9 MiB)
    • 098xxxxx.q.zip (2.1 GiB)
    • 098xxxxx.zip (3.2 GiB)
    • 100xxxxx.Q.zip (2.0 GiB)
    • 100xxxxx.X.zip (1.3 GiB)
    • 100xxxxx.zip (5.4 GiB)
    • 101xxxxx.Q.zip (1.4 GiB)
    • 101xxxxx.X.zip (761.0 MiB)
    • 101xxxxx.zip (3.1 GiB)
    • 102xxxxx.Q.zip (1.1 GiB)
    • 102xxxxx.X.zip (592.2 MiB)
    • 102xxxxx.zip (2.2 GiB)
    • 103xxxxx.Q.zip (1.2 GiB)
    • 103xxxxx.X.zip (627.9 MiB)
    • 103xxxxx.zip (2.2 GiB)
    • 104xxxxx.Q.zip (1.2 GiB)
    • 104xxxxx.X.zip (517.6 MiB)
    • 104xxxxx.zip (2.3 GiB)
    • 105xxxxx.Q.zip (1.1 GiB)
    • 105xxxxx.X.zip (559.5 MiB)
    • 105xxxxx.zip (2.2 GiB)
    • 106xxxxx.Q.zip (1.2 GiB)
    • 106xxxxx.X.zip (607.5 MiB)
    • 106xxxxx.zip (2.3 GiB)
    • 107xxxxx.Q.zip (1.1 GiB)
    • 107xxxxx.X.zip (558.6 MiB)
    • 107xxxxx.zip (2.2 GiB)
    • 108xxxxx.Q.zip (1.3 GiB)
    • 108xxxxx.X.zip (629.9 MiB)
    • 108xxxxx.zip (2.3 GiB)
    • 109xxxxx.Q.zip (1.3 GiB)
    • 109xxxxx.X.zip (634.2 MiB)
    • 109xxxxx.zip (2.6 GiB)
    • TBIB_E621_2022.tsv (72.9 MiB)
    • TBIB_E621_2022_TAGS.tsv (210.3 MiB)