this post was submitted on 15 Oct 2024
35 points (100.0% liked)

Selfhosted

40200 readers
763 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

  1. Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.

  2. No spam posting.

  3. Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.

  4. Don't duplicate the full text of your blog or github here. Just post the link for folks to click.

  5. Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).

  6. No trolling.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 1 year ago
MODERATORS
 

Am using Calibre and audiobookshelf. I'd love a solution where I can search the actual contents of the books. Like being able to search for topics inside all of my books.

Would be a cool AI feature - similar to how immich works.

Does anyone have a solution for that?

all 16 comments
sorted by: hot top controversial new old
[–] [email protected] 14 points 1 month ago

Calibre can create a full text index to search through everything (well, for files that actually contain the text, and it needs a lot of space).

[–] [email protected] 8 points 1 month ago (3 children)

I had the same idea a while back and was wondering why no one has implemented something like this yet. This seems like an actual useful application for LLMs.

I am using Zotero (Citation Management Software) to collect scientific Articles I have read. Sometimes I forget in which Article I read about something specific. A search, where you could describe what you are looking for in a sentence, which then returns the Article with the relevant part, would be a gamechanger.

[–] [email protected] 5 points 1 month ago (3 children)

What you are looking for is a RAG and is one of the few legitimately useful implementations of LLMs outside the wall of hype.

[–] [email protected] 3 points 1 month ago

Thanks for the link! Learned something new today.

[–] [email protected] 1 points 1 month ago

nice, that is exactly what I'm looking for - thanks : )

[–] [email protected] 1 points 1 month ago

I've had the idea for a while to use an LLM to gather metadata about books for me as well as generate tag lists for themes, plot, writing style, etc for everything in my ebook library. You could also generate non spoiler plot summaries and produce recommendations for similar books.

[–] [email protected] 1 points 1 month ago

A search, where you could describe what you are looking for in a sentence, which then returns the Article with the relevant part, would be a gamechanger.

Yeah, exactly that

[–] [email protected] 5 points 1 month ago

Paperlessngx will store pdfs and index their contents for searching. It's not necessarily meant for books but I think it would work.

[–] [email protected] 2 points 1 month ago (2 children)

Keep the actual epubs and search in those? (You basically want a transcript of a read out book...which is the book itself.)

[–] [email protected] 6 points 1 month ago (1 children)

He wants all of his books in one index.

[–] [email protected] 2 points 1 month ago (1 children)

And I think they want a solution that'll index audio books, too. An LLM that'll listen to, transcribed, and index audio books.

[–] [email protected] 2 points 1 month ago (1 children)

Audiobooks have a simple workaround if you can find a version of the book in text format to download, just index that.

[–] [email protected] 1 points 1 month ago

Sure. I'm just saying, I think OP is looking for something that doesn't require either buying the book again or pirating it.

[–] [email protected] 1 points 1 month ago

yeah I'm only talking about ebooks - I just mentioned audiobookshelf because it can also do ebooks and I've read here that people use it as a ebook management thing