science

16030 readers

399 users here now

A community to post scientific articles, news, and civil discussion.

rule #1: be kind

<--- rules currently under construction, see current pinned post.

2024-11-11

founded 2 years ago

MODERATORS

[email protected]

231

AI models fed AI-generated data quickly spew nonsense (www.nature.com)

submitted 7 months ago by [email protected] to c/[email protected]

52 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 1 points 7 months ago (3 children)

As long as you verify the output to be correct before feeding it back is probably not bad.

[–] [email protected] 3 points 7 months ago (1 children)

How do you verify novel content generated by AI? How do you verify content harvested from the Internet to "be correct"?

[–] [email protected] 2 points 6 months ago

Same way you verified the input to begin with. Human labor

[–] [email protected] 3 points 7 months ago

That’s correct, and the paper supports this. But people don’t want to believe it’s true so they keep propagating this myth.

Training on AI outputs is fine as long as you filter the outputs to only things you want to see.

[–] [email protected] 1 points 7 months ago

The issue is that A.I. always does a certain amount of mistakes when outputting something. It may even be the tiniest, most insignificant mistake. But if it internalizes it, it'll make another mistake including the one it internalized. So on and so forth.

Also this is more with scraping in mind. So like, the A.I. goes on the internet, scrapes other A.I. images because there's a lot of them now, and becomes worse.