this post was submitted on 26 Jul 2024
231 points (97.1% liked)
science
15077 readers
114 users here now
A community to post scientific articles, news, and civil discussion.
rule #1: be kind
<--- rules currently under construction, see current pinned post.
2024-11-11
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
As long as you verify the output to be correct before feeding it back is probably not bad.
How do you verify novel content generated by AI? How do you verify content harvested from the Internet to "be correct"?
Same way you verified the input to begin with. Human labor
That’s correct, and the paper supports this. But people don’t want to believe it’s true so they keep propagating this myth.
Training on AI outputs is fine as long as you filter the outputs to only things you want to see.
The issue is that A.I. always does a certain amount of mistakes when outputting something. It may even be the tiniest, most insignificant mistake. But if it internalizes it, it'll make another mistake including the one it internalized. So on and so forth.
Also this is more with scraping in mind. So like, the A.I. goes on the internet, scrapes other A.I. images because there's a lot of them now, and becomes worse.