this post was submitted on 08 May 2024
1716 points (99.4% liked)
Technology
58303 readers
18 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Maybe we need a technical questions and answers siteon the fediverse!
Not gonna stop your knowledge being fed to an AI.
Is there an actual way to stop it? I don't think so. At least, moving to the fediverse would stop any particular corporation from having the monopoly of it, prevent reddit-like abuse of power, would give users more power, among a few other things.
what about instances that need you to be logged in to view posts and require authorized requests for federation?
All it needs is an account to access troves of training data?
That should be manually approved
How restrictive do you want to be with the accounts? If you're too restrictive, there won't be enough users. If you're not restrictive enough, the data will be used for AI training.
That defeats the purpose of a knowledge base. The whole reason why everyone is using SO is that you don't need an account to access it and it's fully indexed by Google.
The real question is why the fuck are people ok with Google indexing SO and not OpenAI? Doesn't make any fucking sense.
Because Google is free and OpenAI isn't. It's one thing to take free content, index it, then allow anyone to access that index. It's another thing when you take free content, index it, then hide that index behind a paywall.
Are you sure? Because Google is not free at all, you're paying for it through privacy invasion and ads. While ChatGPT is actually free to use for end users - no ads, nothing.
https://openai.com/api/pricing/
No, it's free https://chatgpt.com/
As your link is for custom enterprise solutions, it's worth noting that Google has the same shit which also costs money https://cloud.google.com/pricing/
It's "freemium", not free. There is a difference. You can't use ChatGPT 4 without paying as well as the API. Also, you are limited in the number of prompts you can make per hour before you are put on pause and asked to pay.
Search engines like Ecosia, DuckDuckGo, etc. don't ask you for money. Regardless how intensively you use it. (They might come with other drawbacks though like Google with privacy, environment, ethical principles, ...)
It;s more free than Google.
I've never been asked to pay for using one of the aforementioned search engines. I have been asked to pay for OpenAI products.
So I don't see how you come to that conclusion.
Read the comments
The ones where you just claim that despite it being not true or which ones do you mean?
Not true? Ahaha! Good job spreading misinformation!
Well... as I said. OpenAI asks for money, search engines usually don't. Ergo, OpenAI is not free. (But freemium.)
Despite claiming that's not the case, you lack the necessary proof and don't seem to care about countering my argument with something of substance.
Such a discussion will not be fruitful if you are unwilling to deliver.
It's free, what else do you want?
That you deliver reasons for why you claim I'm wrong.
It's freemium, not free. As I said before, OpenAI limits the number of prompts you can make per hour in case you don't want to pay. Also, using the API or ChatGPT 4 costs money. Users of search engines are usually not asked for money.
What does Google's cloud service have to do with what we're discussing (Google indexing content vs. ~~SO~~ OpenAI doing it)? They're not even similar services.
Edit: SO -> OpenAI
The fuck are you talking about?
The price difference is that google steals your data. That's it. OpenAI steals data, ask for money to use most of their models, and buy even more data from other companies stealing user data (like google and SO). Also indexing web pages is not even the "stealing" part of google, it's just not comparable.
Yes, training AI on user data for free then selling the end product is a reasonable thing to be concerned about. It'd be different if the product was free or the data was sold to them with user consent.
SO has announced a subscription-based service trained on user data for free, and not only there's not even opt-out, they're mass-banning users for trying to "opt-out" manually. Tell me one thing here that's not completely fucked up.
But it's free. Unlike Google.
Nothing stopping them from scraping that too