I’m looking for a self hosted solution to this problem:

I want to create a full text search index from a collection of PDF manuals (text, not images, I don’t care about OCR here). There is a UI to search for text matches in documents, and clicking a search hit opens the PDF scrolled to where the search hit is (bonus points if the search hit is hilighted)

  • Illecors@lemmy.cafe
    link
    fedilink
    arrow-up
    9
    ·
    1 year ago

    A very crude solution would be to merge all PDFs into a single file and just use the search of your favourite PDF viewer.