this post was submitted on 14 Oct 2024
110 points (95.8% liked)

Asklemmy

43948 readers
723 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy πŸ”

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_[email protected]~

founded 5 years ago
MODERATORS
 

Google, DuckDuckGo and Bing now all return the same shitty LLM-generated nonsense sites to most of my searches, and don't respect my literal search terms even when I put them in quotes.

I'm not ready to pay for search, yet.

Is there any alternative?

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 65 points 1 month ago (1 children)

What are you talking about? I just tried two test queries on DDG, and neither one had LLM-generated nonsense, and the one that was in double-quotes returned only five results, all of which had the double-quoted phrase and one of which was the thing I was challenging it to find.

Can you give an example of a query where DDG returns LLM results or doesn't respect your double-quotes?

[–] [email protected] 48 points 1 month ago (2 children)

I think they are referring to the search engines returning LLM content farm websites.

[–] [email protected] 7 points 1 month ago (4 children)

Maybe I’m a little out of the loop, what are llm content farm websites?

[–] [email protected] 35 points 1 month ago (2 children)

Low effort websites made easier by LLM generated text. It’s not new, just made easier with the ubiquity of LLM tools. Think of it as the latest generation of spam websites πŸ™ƒ

[–] [email protected] 15 points 1 month ago (1 children)

Ah I see. Junk β€˜news’ and other regurgitated blah. Yeah, I’d guess any free search engine will probably be bloated with that. Not to mention that it’s google, bing, and orange bing. Not a ton of crawlers out there indexing everything is there?

[–] [email protected] 4 points 1 month ago

The only other (not absolutely tiny) one I'm aware of is brave, but it has its own issues

[–] [email protected] 2 points 1 month ago* (last edited 1 month ago)

Ironically one of DDG's early selling points, before they fully jumped on the privacy bandwagon, was that they would filter out results for low-effort content farms (this was pre-LLM stuff).

I had used DDG since almost the beginning and it was one of the things I was originally sold on. It's difficult to find a source for it now but I did find this: https://web.archive.org/web/20110608072253/https://www.technologyreview.com/blog/post.aspx?bid=377&bpid=25532

[–] [email protected] 6 points 1 month ago

Forbes for example

[–] JackbyDev 5 points 1 month ago (1 children)

Think recipe websites that take forever to get to the recipe but it's for other topics. Like a simple question, "what is the release date for X new game?" And then there will be like 5+ paragraphs of jibber jabber about the game and then finally the last article will say when it releases.

This sort of site has been around for a while but supposedly they're more common nowadays. Personally I think people just have a better eye for things not written entirely by humans. Either way it's annoying to deal with them.

[–] [email protected] 5 points 1 month ago

Ugh I feel like I have been seeing more of that. Asked how many ml in a wine pour and got like 5 sites that wouldn’t just come out and say it. All kinds of gobbledegook dancing around the topic but no one would just freaking say it. 140ml in case you needed it

[–] [email protected] 4 points 1 month ago

Sites that mass-generate garbage using llms