6b3982ecf0
don't get confused by irrelevant bits of text inserted by PDF generation tools
11ea5a0d58
more useful explanations of PDF failures
42b49c7ecc
prioritize matches with more consecutive characters
f994060149
suggest works with similar titles that aren't already in the same series
e8c553e5d8
add suggested works (next step: make useful suggestions)
7535cb6162
also check whether PDFs have text alongside images
c042163e85
properly handle edge case when we point collate or manual-collate directly at an extraction directory
9fea03c270
add option to convert PDF pages to pixmaps as needed
be99dc5578
misc code cleanup
65017abe00
filter by language preference when collating
0be720599d
refactor collation code
3ed462972a
more regexes, shorten regex flags