Commit graph

166 commits

Author SHA1 Message Date
xenofem 11ea5a0d58 more useful explanations of PDF failures 2024-03-11 05:47:47 -04:00
xenofem 6f4444f67e misc corrections/optimizations in top() and similarity() 2024-03-03 11:42:21 -05:00
xenofem 42b49c7ecc prioritize matches with more consecutive characters 2024-03-03 03:21:53 -05:00
xenofem f994060149 suggest works with similar titles that aren't already in the same series 2024-03-03 02:56:46 -05:00
xenofem e8c553e5d8 add suggested works (next step: make useful suggestions) 2024-03-03 02:14:56 -05:00
xenofem f94f23186b convert extra-weird PDF image formats like JBIG2 to PNG 2024-03-03 01:47:29 -05:00
xenofem 0d1bff74c2 small fixes to alphabetic_numbering detection 2024-03-03 01:08:57 -05:00
xenofem 7535cb6162 also check whether PDFs have text alongside images 2024-03-02 18:27:15 -05:00
xenofem c042163e85 properly handle edge case when we point collate or manual-collate directly at an extraction directory 2024-03-02 18:10:22 -05:00
xenofem 9fea03c270 add option to convert PDF pages to pixmaps as needed 2024-03-02 18:09:46 -05:00
xenofem 2db4553570 search bar 2024-03-02 00:50:28 -05:00
xenofem be99dc5578 misc code cleanup 2024-03-02 00:15:18 -05:00
xenofem 65017abe00 filter by language preference when collating 2024-03-02 00:13:30 -05:00
xenofem 0be720599d refactor collation code 2024-03-01 23:43:38 -05:00
xenofem 3ed462972a more regexes, shorten regex flags 2024-02-22 19:24:01 -05:00
xenofem 15410ca8ba cleaner handling of better/worse image quality versions 2024-02-20 17:23:33 -05:00
xenofem 1457168951 prefer versions without moire-inducing tone effects 2024-02-17 02:54:04 -05:00
xenofem fd6e1f0eb4 give more opportunities to compare images/pdf between each stage of splitting 2024-02-16 16:29:55 -05:00
xenofem 25f44473c4 apply unicode normalization while finding prefix orderings, because the world is bad 2024-02-16 16:08:56 -05:00
xenofem 34c0435d05 use packages.<system>.default instead of defaultPackage in flake 2024-02-15 19:35:35 -05:00
xenofem 33c6b9fa1f detect when a PDF has images at the same resolution, just chopped up 2024-02-15 19:32:50 -05:00
xenofem 7a96bc5655 expand bonus regex, refactor regex splitting code 2024-02-13 10:45:21 -05:00
xenofem 1adabbe5e6 handle concept art type bonus stuff 2024-02-13 10:29:50 -05:00
xenofem c25e539a0f smoother progress bar with requestAnimationFrame 2024-02-11 17:55:00 -05:00
xenofem 89093ac490 smarter image preloading 2024-02-11 17:13:10 -05:00
xenofem 6bc8332f47 make sure to create site/images if it doesn't exist 2024-02-11 14:38:50 -05:00
xenofem 437cbcd73f add omake to epilogue regex 2024-02-08 05:29:44 -05:00
xenofem 83c5e128d6 recognize back covers as well as front covers 2024-02-08 05:11:17 -05:00
xenofem f5f338f386 add short aliases for subcommands 2024-02-08 05:03:40 -05:00
xenofem d06ea132af include full-width numerals in number regex 2024-02-08 04:54:36 -05:00
xenofem b7f9099fc0 more textless regex 2024-02-08 04:32:05 -05:00
xenofem 0955ec1c90 tweak flexbox stuff so work preview and work info don't get huge gap between them 2024-02-08 03:39:00 -05:00
xenofem 6a82009dd0 add .bmp to recognized image formats 2024-02-08 00:14:24 -05:00
xenofem 2ee0c49b3d add more cover regex, make all the regexes case-insensitive 2024-02-08 00:06:30 -05:00
xenofem 33a65e3a2b more cover regex 2024-02-08 00:03:05 -05:00
xenofem 62bd4891f6 be more forgiving about checking index deltas 2024-02-07 23:59:35 -05:00
xenofem de01acabe5 more textless regex 2024-02-07 23:11:37 -05:00
xenofem bfee8affe1 go back to deducing work_id automatically in manual-collate 2024-02-07 22:52:17 -05:00
xenofem ad71c6c280 version 0.2 2024-02-07 22:33:41 -05:00
xenofem bee5f7c58a completely refactor manual collation to be more ergonomic 2024-02-07 22:32:31 -05:00
xenofem 9353357dc9 for alphabet ordering, blank is 0, a is 1 2024-02-07 20:45:46 -05:00
xenofem 2d1add4f3e need to actually return values, whould've thought 2024-02-07 19:51:46 -05:00
xenofem 9c6328659f refactor checking file extensions 2024-02-07 19:18:19 -05:00
xenofem c24c811115 handle when the pdf is in a folder called PDF instead of being right there in the directory 2024-02-07 19:11:37 -05:00
xenofem 9ff18f933b detect cover images and hi-res editions 2024-02-07 17:42:18 -05:00
xenofem 51243aca6e add more textless regex, fix bug in alphabetized uniqueness checker 2024-02-07 17:12:02 -05:00
xenofem 7ab32041c8 import readline so hopefully input prompts will suck less 2024-02-07 00:24:30 -05:00
xenofem 885b4884fd don't write description of 'None' if the field is empty 2024-02-07 00:14:22 -05:00
xenofem cf66c7db0a make author relations and tag relations unique in db schema 2024-02-07 00:09:42 -05:00
xenofem c2f516a281 add subcommand to show page counts and image sizes of an extracted work's files 2024-02-06 23:52:59 -05:00