527
‘Impossible’ to create AI tools like ChatGPT without copyrighted material, OpenAI says
(www.theguardian.com)
This is a most excellent place for technology news and articles.
The difference here is that a child can't absorb and suddenly use massive amounts of data.
The act of learning is absorbing and using massive amounts of data. Almost any child can, for example, re-create copyrighted cartoon characters in their drawing or whistle copyrighted tunes.
If you look at, pretty much, any and all human created works, you will be able to trace elements of those works to many different sources. We, usually, call that "sources of inspiration". Of course, in case of human created works, it's not a big deal. Generally, it's considered transformative and a fair use.
I really don't understand this whole "learning" thing that everybody claims these models are doing.
A Markov chain algorithm with different inputs of text and the output of the next predicted word isn't colloquially called "learning", yet it's fundamentally the same process, just less sophisticated.
They take input, apply a statistical model to it, generate output derived from the input. Humans have creativity, lateral thinking and the ability to understand context and meaning. Most importantly, with art and creative writing, they're trying to express something.
"AI" has none of these things, just a probability for which token goes next considering which tokens are there already.
I don't think "learning" is a word reserved only for high-minded creativeness. Just rote memorization and repetition is sometimes called learning. And there are many intermediate states between them.