this post was submitted on 21 Aug 2024
134 points (90.9% liked)

Technology

58303 readers
8 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

My original, editorialized title: Ars Technica Sells Out


Linking to this because I know people here read Ars Technica, and I totally didn't become a subscriber three days before this was announced. Nope. No sir.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] [email protected] 15 points 3 months ago (1 children)

This is the logical endpoint for all the people who were complaining that scraping the open web for training is somehow immoral/illegal. Instead of stopping AI those with deep pockets will continue to train on everything while open source and small company efforts will be locked out.

[โ€“] [email protected] 2 points 3 months ago* (last edited 3 months ago)

Useful AI will be focused and narrow unless they actually achieve AGI.

Scraping literally the whole internet for inspiration is part of the reason they come up with utter rubbish. No one's actually scrutinizing what their ingesting. It's not so much a problem that they violate copyright it's more an issue that because they do it in this manner their output is garbage.

If these AI companies actually did some content curation we might get decent AI out of it.