this post was submitted on 04 Aug 2023
43 points (100.0% liked)
Beehaw Support
2796 readers
6 users here now
Support and meta community for Beehaw. Ask your questions about the community, technical issues, and other such things here.
A brief FAQ for lurkers and new users can be found here.
Our September 2024 financial update is here.
For a refresher on our philosophy, see also What is Beehaw?, The spirit of the rules, and Beehaw is a Community
This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I'm just trying to understand, generally speaking, how the bot works. It appears to me that the bot is looking for posted articles that exceed a certain word count threshold. If it finds these, then it creates a summary and posts this as a comment. Am I understanding this correctly?
It has support for specific news sites, I don't want to rely on some automatic text extraction because those are prone to breaking. Here are the content extractors themselves, each for one site. If a post that contains a link to any of the supported sites is found across all of Lemmy (that the bot can see), it extracts the text and then summarizes it using this. It takes 6 sentences directly from the article that look most important to the machine learning model it uses. Then it posts it as a comment.