Microblog Memes

5467 readers

1 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

Rules:

Please put at least one word relevant to the post in the post title.
Be nice.
No advertising, brand promotion or guerilla marketing.
Posters are encouraged to link to the toot or tweet etc in the description of posts.

Related communities:

founded 1 year ago

MODERATORS

[email protected]

1555

Amazon: We can't make money if our workers get bathroom breaks (media.mas.to)

submitted 9 months ago by [email protected] to c/[email protected]

85 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 17 points 9 months ago (1 children)

Public data still have licenses. Eg, some open source licences force you to open source the software you created using them, something OpenAI doesn't do.

[–] [email protected] 1 points 9 months ago (1 children)

If you're using it as you found it, then yeah. But if I take derived data from it like word count and word frequency, it's not exactly the same thing and we call that statistics. Now if I draw associations of how often certain words appear together, and then compound that with millions of other sources to create a map of related words and concepts, I'm no longer using the data as you described because I'm doing something entirely different with it. What LLMs do is generates new information from its underlying sources.

[–] [email protected] 4 points 9 months ago (1 children)

In my example, they would still be using the source code to create new software that is not open source, not matter how many Markov chains are behind it.

[–] [email protected] -3 points 9 months ago* (last edited 9 months ago)

That's really stretching it, tbh. You're arguing that the cake is made of chicken because it contained whole eggs at some point.