this post was submitted on 21 Aug 2024
134 points (90.9% liked)

Technology

58303 readers
11 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

My original, editorialized title: Ars Technica Sells Out


Linking to this because I know people here read Ars Technica, and I totally didn't become a subscriber three days before this was announced. Nope. No sir.

top 30 comments
sorted by: hot top controversial new old
[–] [email protected] 59 points 3 months ago* (last edited 3 months ago) (2 children)

The main problem now is that Ars Technica and all other Conde Nast publications, with it now having a vested interest in openAI (they're getting paid by them), can no longer be reliably trusted to report on any AI or AI-adjacent topic whatsoever. And every user comment and content is now owned by openAI.

[–] [email protected] 2 points 3 months ago (1 children)

I bet MFCBot would work super well if it was updated to report on if the publication's owners had a vested interest in the topic.

[–] [email protected] 1 points 3 months ago (1 children)

Oh it would. It would require the MFC site itself to actually collect and collate data on site's parent company investment portfolios and that's a pretty massive ask from what sounds like a very tiny team.

[–] [email protected] 1 points 3 months ago (1 children)

I wonder if it could somehow pull that data from somewhere like ground news somehow

[–] [email protected] 1 points 3 months ago

It might be able to? Ground has a browser extension that likely indicates they have some kind of external API access, it probably could be integrated with a little bit of effort.

[–] [email protected] 2 points 3 months ago (1 children)

I'm not so sure Ars has a vested interest in OpenAI. I actually read through 10 pages of comments. Ken Fisher was pretty active in them, and noted several times that Ars doesn't see any of the money from this deal.

[–] [email protected] 1 points 3 months ago (1 children)

Ars does not. Their chief editor has said as much. But Conde Nast absolutely does, and it WILL happen where CN will start telling Ars what they can and can't do, because that's how corporate ownership works. It happens every single time without fail, and that is why you can no longer trust anything they post related to AI. You toe the line to capital's interest, or CN is gonna eject you from the org.

[–] [email protected] 1 points 3 months ago

Conde Nast doesn't have an interest in driving away their readers, and AI bullshit absolutely will drive them away. They know this. Ken Fisher is the editorial lead for AI at Conde Nast (not just Ars), and said as much in the comment section.

AI is absolutely fucking things up on a grand scale in all sorts of industries, but as of right now, Ars is relatively safe and I don't think we need to inflate the scale of the danger.

[–] [email protected] 33 points 3 months ago (1 children)

Damnit! I still like and respect the Ars Technica staff but Condé Nast can piss off.

I feel for you KingThrillgore. I was thinking of supporting the site with a subscription but not after this. Still if enough people stop subscribing we may loose them altogether. This is a double edged shit sword.

[–] [email protected] 25 points 3 months ago* (last edited 3 months ago) (2 children)

I made it clear in my comment I was not happy about this after I became a subscriber. It will not auto-renew. If I had done it with a credit card and not Paypal, I'd try for a chargeback.

For what its worth, nobody else active on the site is happy about this, either. Lots of unsubscribes are being claimed in the comments (including mine).

Edit: and a comment from the EIC says CN will not share revenue from OpenAI with Ars Technica. Incredible. They're basically getting nothing out of it.

[–] [email protected] 4 points 3 months ago

Understood and have seen the comments. I don’t blame you for being upset. It’s a crappy situation.

[–] fuzzzerd 1 points 3 months ago

Yeah. That post has over 900 comments, which is rare even for an Ars article. Mostly negative sentiment, which I get.

[–] [email protected] 17 points 3 months ago* (last edited 3 months ago) (2 children)

llms suck because they steal content and are unreliable since they don't link back to sources

Open ai makes a deal to pay media org for there content and makes it so they can link back to original article

"Ars technica sold out"

[–] [email protected] 8 points 3 months ago* (last edited 3 months ago) (1 children)

Condé Nast didn't just sell access to their subsidiaries' content, but also to the user generated content on those subsidiaries' sites. That's at issue here.

It also has a possibility to cause a conflict of interest for Ars Technica to write about OpenAI. That's the second issue here.

And, as per the editor in chief, the money doesn't go to Ars Technica, but to Condé Nast.

[–] [email protected] -4 points 3 months ago* (last edited 3 months ago)

Yes they sold access to the user content we’ve generated after we explicitly agreed to the fact that they may do so. If you’ve chosen to not read the fine print when you created an account and created content for them, that’s sort of up to you tbh.

All media companies have owners and potential conflicts of interest. Arstechnica (Conde Nast) is no different. They’ve explicitly called out any potential for conflict of interest when it has arisen in the past.

Of course the money goes to Conde Nast, they own the brand Ars Technica and employ the people who write for it; that doesn’t mean it doesn’t figure on Ars Technica’s budget when Conde Nast decides whether to continue paying the salary of the staff.

[–] [email protected] 2 points 3 months ago

It's funny because you're making the opposite point of the one you think you're making. Cause if you put together the two pieces of information from your comment, the entire picture is:

Open ai makes a deal to pay media org for there content and makes it so they can link back to original article, with the money they make from stealing everybody else's content

That's already pretty bad, even without that points you neglected to mention, like how some of the content that is indirectly making money for Ars Technica is stolen from their competitors, or how Ars Technica basically became a worthless journalistic source for AI at a time where public opinion is not yet settled on its morality and precedent has not been set on its legality. How is this not "sold out" to you?

[–] [email protected] 16 points 3 months ago (2 children)

I understand them. If they refused for integrity reasons, openai would steal their content anyways via scrapers.

Suing them for copyright infringement, even if is the desire we all have, is ultra expensive.

I would also have signed that deal with the devil...

[–] [email protected] 3 points 3 months ago

I would have signed it too, but only because I'm awesome on fiddle and I could totally win my soul back.

[–] [email protected] 2 points 3 months ago

Either you sell your soul for something, or get it taken from you, leaving you with nothing.

[–] [email protected] 15 points 3 months ago (1 children)

This is the logical endpoint for all the people who were complaining that scraping the open web for training is somehow immoral/illegal. Instead of stopping AI those with deep pockets will continue to train on everything while open source and small company efforts will be locked out.

[–] [email protected] 2 points 3 months ago* (last edited 3 months ago)

Useful AI will be focused and narrow unless they actually achieve AGI.

Scraping literally the whole internet for inspiration is part of the reason they come up with utter rubbish. No one's actually scrutinizing what their ingesting. It's not so much a problem that they violate copyright it's more an issue that because they do it in this manner their output is garbage.

If these AI companies actually did some content curation we might get decent AI out of it.

[–] [email protected] 12 points 3 months ago (1 children)

Their EIC is in the comments replying to people. He seems buttoned up about it (understandably) but I get the impression he’s not thrilled

[–] [email protected] 2 points 3 months ago

Definitely not thrilled, and he says in the comments that ars does not benefit financially from this.

[–] [email protected] 7 points 3 months ago

Never trust Condé Nast to do the right by its consumers. That's a tale decades old at this point.

[–] [email protected] 6 points 3 months ago (1 children)

what’s the problem here? openai isn’t pirating content if they pay for it,? have i misunderstood something

[–] [email protected] 3 points 3 months ago* (last edited 3 months ago)

ai new

new bad

remember old time

old time good

[–] [email protected] 6 points 3 months ago

I want Ars content to be part of whatever training data is provided to the best models. How does that get done without appearing like they are being bought?

Even if their contract explicitly states that it is a data sharing agreement only and the products of the media organization (articles/investigations) are not grounds for breach or retaliation, it is assumed that there is now some impartiality in future reporting.

So, for all media companies, the options seem to be:

  1. Contribute to the greater good by openly permitting site scraping (for $0)
  2. Allow data sharing to contracted parties only (for a fee)
  3. Public or privately prohibit use of any data, and then seek damages down the road for theft/copyright infringement when the legal framework has been established.

Is there a GPL or other license structure that permits data sharing for LLM training in a way that it does not get transformed into something evil?