i wish there were an LLM hooked up to scihub
Actually Useful AI
Welcome! 🤖
Our community focuses on programming-oriented, hype-free discussion of Artificial Intelligence (AI) topics. We aim to curate content that truly contributes to the understanding and practical application of AI, making it, as the name suggests, "actually useful" for developers and enthusiasts alike.
Be an active member! 🔔
We highly value participation in our community. Whether it's asking questions, sharing insights, or sparking new discussions, your engagement helps us all grow.
What can I post? 📝
In general, anything related to AI is acceptable. However, we encourage you to strive for high-quality content.
What is not allowed? 🚫
- 🔊 Sensationalism: "How I made $1000 in 30 minutes using ChatGPT - the answer will surprise you!"
- ♻️ Recycled Content: "Ultimate ChatGPT Prompting Guide" that is the 10,000th variation on "As a (role), explain (thing) in (style)"
- 🚮 Blogspam: Anything the mods consider crypto/AI bro success porn sigma grindset blogspam
General Rules 📜
Members are expected to engage in on-topic discussions, and exhibit mature, respectful behavior. Those who fail to uphold these standards may find their posts or comments removed, with repeat offenders potentially facing a permanent ban.
While we appreciate focus, a little humor and off-topic banter, when tasteful and relevant, can also add flavor to our discussions.
Related Communities 🌐
General
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
- [email protected]
Chat
Image
Open Source
Please message @[email protected] if you would like us to add a community to this list.
Icon base by Lord Berandas under CC BY 3.0 with modifications to add a gradient
Yeah that would be awesome, I couldn’t get this one to search scihub but the Academic mode does search more reputable sources
I'm building a custom search AI as well, but it would be pretty big to host a scihub search LLM.
It says there are 88,343,822 articles. For an AI to work effectively, you'll have to slice up the articles into paragraphs, so you will probably end up with between 10x to 100x slices. For those slices you'd have to get the embed vector and store it in a Vector database.
One 1536 vector is about 6.15 KB, meaning 54331450530 KB for everything, or 543 GB in vectors
Seems to have that whole sassy thing that Gonk (I really hate that name) is promising, I'll check it out I guess. How is it at answering questions accurately? There's no point if I have to search the validity anyway
Well in that example I gave it a very exaggerated prompt to give it that sarcasm, then I used RegEx to remove the source citations, but normally you can just click the citation links it includes to verify the info.
That's cool. I've been using this for the past 2 days and wow, it's really capable. Thanks for sharing!