Commit graph

59 commits

Author SHA1 Message Date
xenofem 15410ca8ba cleaner handling of better/worse image quality versions 2024-02-20 17:23:33 -05:00
xenofem 1457168951 prefer versions without moire-inducing tone effects 2024-02-17 02:54:04 -05:00
xenofem fd6e1f0eb4 give more opportunities to compare images/pdf between each stage of splitting 2024-02-16 16:29:55 -05:00
xenofem 25f44473c4 apply unicode normalization while finding prefix orderings, because the world is bad 2024-02-16 16:08:56 -05:00
xenofem 33c6b9fa1f detect when a PDF has images at the same resolution, just chopped up 2024-02-15 19:32:50 -05:00
xenofem 7a96bc5655 expand bonus regex, refactor regex splitting code 2024-02-13 10:45:21 -05:00
xenofem 1adabbe5e6 handle concept art type bonus stuff 2024-02-13 10:29:50 -05:00
xenofem 6bc8332f47 make sure to create site/images if it doesn't exist 2024-02-11 14:38:50 -05:00
xenofem 437cbcd73f add omake to epilogue regex 2024-02-08 05:29:44 -05:00
xenofem 83c5e128d6 recognize back covers as well as front covers 2024-02-08 05:11:17 -05:00
xenofem f5f338f386 add short aliases for subcommands 2024-02-08 05:03:40 -05:00
xenofem d06ea132af include full-width numerals in number regex 2024-02-08 04:54:36 -05:00
xenofem b7f9099fc0 more textless regex 2024-02-08 04:32:05 -05:00
xenofem 6a82009dd0 add .bmp to recognized image formats 2024-02-08 00:14:24 -05:00
xenofem 2ee0c49b3d add more cover regex, make all the regexes case-insensitive 2024-02-08 00:06:30 -05:00
xenofem 33a65e3a2b more cover regex 2024-02-08 00:03:05 -05:00
xenofem 62bd4891f6 be more forgiving about checking index deltas 2024-02-07 23:59:35 -05:00
xenofem de01acabe5 more textless regex 2024-02-07 23:11:37 -05:00
xenofem bfee8affe1 go back to deducing work_id automatically in manual-collate 2024-02-07 22:52:17 -05:00
xenofem bee5f7c58a completely refactor manual collation to be more ergonomic 2024-02-07 22:32:31 -05:00
xenofem 9353357dc9 for alphabet ordering, blank is 0, a is 1 2024-02-07 20:45:46 -05:00
xenofem 2d1add4f3e need to actually return values, whould've thought 2024-02-07 19:51:46 -05:00
xenofem 9c6328659f refactor checking file extensions 2024-02-07 19:18:19 -05:00
xenofem c24c811115 handle when the pdf is in a folder called PDF instead of being right there in the directory 2024-02-07 19:11:37 -05:00
xenofem 9ff18f933b detect cover images and hi-res editions 2024-02-07 17:42:18 -05:00
xenofem 51243aca6e add more textless regex, fix bug in alphabetized uniqueness checker 2024-02-07 17:12:02 -05:00
xenofem 7ab32041c8 import readline so hopefully input prompts will suck less 2024-02-07 00:24:30 -05:00
xenofem cf66c7db0a make author relations and tag relations unique in db schema 2024-02-07 00:09:42 -05:00
xenofem c2f516a281 add subcommand to show page counts and image sizes of an extracted work's files 2024-02-06 23:52:59 -05:00
xenofem cdf06d9aa0 ignore the mp4 files that fanza throws in sometimes 2024-02-06 23:02:59 -05:00
xenofem 7e429f3160 detect if there are equivalent PDF and image-file versions, and choose whichever is higher-resolution 2024-02-06 23:01:59 -05:00
xenofem 7a63c27702 more alt versions 2024-02-06 13:59:49 -05:00
xenofem 2a70e363cd more alt versions 2024-02-06 13:29:38 -05:00
xenofem 26af5286dc handle combinations of versions, and cases where things have versions but not numbers 2024-02-06 13:20:56 -05:00
xenofem 85ac6bc35c handle cases where the first item isn't numbered 2024-02-06 13:00:44 -05:00
xenofem 61a7eb07f9 detect epilogues in auto collation 2024-02-06 12:48:00 -05:00
xenofem ef0c115f0f add more alt versions 2024-02-06 12:30:13 -05:00
xenofem 657ec65e4a handle alphabetic suffixes 2024-02-06 12:19:11 -05:00
xenofem 330b10c85b more flexible splitting out of textless pages 2024-02-06 11:59:20 -05:00
xenofem aefaf824a8 handle hierarchical numbering schemes in auto-collation 2024-02-06 11:02:08 -05:00
xenofem d66b467c5c stop automatic collation if the sequence of numbers doesn't seem to be indices in an ordering 2024-02-06 10:22:48 -05:00
xenofem cb1a1488e2 smarter automatic collation when there are multiple versions of pages 2024-02-06 10:22:11 -05:00
xenofem aa039e60f5 ignore photoshop documents 2024-02-06 09:34:21 -05:00
xenofem 83d99f4585 smarter automatic collation of textless versions 2024-02-06 09:26:39 -05:00
xenofem 25374a5ba3 default to using first page as thumbnail 2024-02-06 08:29:52 -05:00
xenofem d351da79c1 give more progress info during extract 2024-02-04 00:49:26 -05:00
xenofem 3efee8fe05 support Fakku works using first page as thumbnail for now 2024-01-29 21:25:21 -05:00
xenofem a76e3eec83 support fanza works with manual metadata input 2024-01-29 04:11:55 -05:00
xenofem 309f19574d don't try to fetch dlsite metadata for non-dlsite works 2024-01-29 03:01:35 -05:00
xenofem b75bad995a rework front page to have shuffle and lazy infinite scroll 2024-01-25 04:10:17 -05:00