this post was submitted on 28 Oct 2024
1507 points (98.9% liked)

Technology

58997 readers
4309 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 319 points 3 days ago* (last edited 3 days ago) (32 children)

As a fervent AI enthusiast, I disagree.

...I'd say it's 97% hype and marketing.

It's crazy how much fud is flying around, and legitimately buries good open research. It's also crazy what these giant corporations are explicitly saying what they're going to do, and that anyone buys it. TSMC's allegedly calling Sam Altman a 'podcast bro' is spot on, and I'd add "manipulative vampire" to that.

Talk to any long-time resident of localllama and similar "local" AI communities who actually dig into this stuff, and you'll find immense skepticism, not the crypto-like AI bros like you find on linkedin, twitter and such and blot everything out.

[–] [email protected] 95 points 3 days ago (2 children)

For real. Being a software engineer with basic knowledge in ML, I'm just sick of companies from every industry being so desperate to cling onto the hype train they're willing to label anything with AI, even if it has little or nothing to do with it, just to boost their stock value. I would be so uncomfortable being an employee having to do this.

[–] [email protected] 31 points 3 days ago (1 children)

For sure, it seems like 90% of ai startups are nothing more than front end wrappers for a gpt instance.

[–] [email protected] 19 points 3 days ago* (last edited 3 days ago) (1 children)

They're all built on top of OpenAI which is very unprofitable at the moment. Feels like the whole industry is built on a shaky foundation.

Putting the entire fate of your company in a different company (OpenAI) is not a great business move. I guess the successful AI startups will eventually transition to self-hosted models like Llama, if they survive that long.

[–] [email protected] 6 points 3 days ago (1 children)

Most projects I've been in contact with are very aware of that fact. That's why telemetry is so big right now. Everybody is building datasets in the hopes of fine tuning smaller, cheaper models once they have enough good quality data.

[–] [email protected] 6 points 3 days ago (2 children)

My company is realizing that hosting a model which will be private, cost-effective, and performing better than traditional algorithms is like finding a unicorn. Few months back, the top execs were jumping around GenAI like a bunch of kids. Fortunately, the Sr. research head beat some sense into them.

[–] [email protected] 2 points 2 days ago

You're lucky there's a higher up that could talk down the even higher ups. Though, sometimes it's not even about the r&d teams.

I saw company wide HR educational emails or courses telling you how to improve you work quality/efficiency, and one of them tells us to "research AI" and learn how to utilize it, talking about how great it is and improved the work efficiency by 30%. Sure, it has its uses, but I won't go touting how great it is. And with how ChatGPT works, you have to be the biggest idiot in the world to upload all your sensitive stuff to ChatGPT just for it to make a spreadsheet faster. But without these disclaimers in the email, I doubt regular clerical staff knows about this, and it's extremely dangerous.

[–] [email protected] 1 points 2 days ago (1 children)

What kind of use-cases was it, where you didn't find suitable local models to work with ? I've found that general "chatbot" things are hit and miss but more domain-constrained tasks (such as extracting structured entities from unstructured text) are pretty reliable even on smaller models. I'm not counting my chickens yet as my dataset is still somewhat small but preliminary testing has been very promising in that regard.

[–] [email protected] 2 points 2 days ago (1 children)

What kind of use-cases was it, where you didn’t find suitable local models to work with ?

Any time you ask very domain specific questions; eg "i have collected some soil samples from the mesolithic age near the Amazon basin which have high sulfur and phosphorus content compared to my other samples. What factors could contribute to this distribution?", both of-the-shelf local models & OpenAI fail.

The main reason is because these models are not trained on highly-specialized domains of text. Sometimes the models start hallucinating and which reduces our trust upon them.

[–] [email protected] 2 points 2 days ago

“i have collected some soil samples from the mesolithic age near the Amazon basin which have high sulfur and phosphorus content compared to my other samples. What factors could contribute to this distribution?”

Haha yeah the top execs were tripping balls if they thought some off-the-shelf product would be able to answer this kind of expert questions. That's like trying to replace an expert craftsman with a 3D printer.

[–] [email protected] 6 points 3 days ago (1 children)

As someone who was working really hard trying to get my company to be able use some classical ML (with very limited amounts of data), with some knowledge on how AI works, and just generally want to do some cool math stuff at work, being asked incessantly to shove AI into any problem that our execs think are “good sells” and be pressured to think about how we can “use AI” was a terrible feel. They now think my work is insufficient and has been tightening the noose on my team.

[–] [email protected] 3 points 3 days ago

This. Exactly.

[–] [email protected] 28 points 3 days ago (1 children)

TSMC are probably making more money than anyone in this goldrush by selling the shovels and picks, so if that's their opinion, I feel people should listen...

There's little in the AI business plan other than hurling money at it and hoping job losses ensue.

[–] [email protected] 9 points 3 days ago

TSMC doesn't really have official opinions, they take silicon orders for money and shrug happily. Being neutral is good for business.

Altman's scheme is just a whole other level of crazy though.

[–] [email protected] 16 points 3 days ago (1 children)

I think we should indict Sam Altman on two sets of charges:

  1. A set of securities fraud charges.

  2. 8 billion counts of criminal reckless endangerment.

He's out on podcasts constantly saying the OpenAI is near superintelligent AGI and that there's a good chance that they won't be able to control it, and that human survival is at risk. How is gambling with human extinction not a massive act of planetary-scale criminal reckless endangerment?

So either he is putting the entire planet at risk, or he is lying through his teeth about how far along OpenAI is. If he's telling the truth, he's endangering us all. If he's lying, then he's committing securities fraud in an attempt to defraud shareholders. Either way, he should be in prison. I say we indict him for both simultaneously and let the courts sort it out.

[–] [email protected] 3 points 2 days ago

"When you're rich, they let you do it."

[–] [email protected] 20 points 3 days ago

Seriously, I'd love to be enthusiastic about it because it's genuinely cool what you can do with math.

But the lies that are shoved in our faces are just so fucking much and so fucking egregious that it's pretty much impossible.

And on top of that LLMs are hugely overshadowing actual interesting approaches for funding.

[–] [email protected] 16 points 3 days ago (2 children)

I really want to like AI, I’d love to have an intelligent AI assistant or something, but I just struggle to find any uses for it outside of some really niche cases or for basic brainstorming tasks. Otherwise, it just feels like alot of work for very little benefit or results that I can’t even trust or use.

[–] [email protected] 10 points 3 days ago* (last edited 3 days ago) (1 children)

It's useful.

I keep Qwen 32B loaded on my desktop pretty much whenever its on, as an (unreliable) assistant to analyze or parse big texts, to do quick chores or write scripts, to bounce ideas off of or even as a offline replacement for google translate (though I specifically use aya 32B for that).

It does "feel" different when the LLM is local, as you can manipulate the prompt syntax so easily, hammer it with multiple requests that come back really fast when it seems to get something wrong, not worry about refusals or data leakage and such.

[–] [email protected] 3 points 3 days ago (1 children)

Attractive. You got some pretty solid specs?

Rue the day I cheaped out on RAM. soldered RAMmmm

[–] [email protected] 1 points 3 days ago* (last edited 3 days ago)

Soldered is better! It's sometimes faster, definitely faster if it happens to be lpddr.

But TBH the only thing that really matters his "how much VRAM do you have," and Qwen 32B slots in at 24GB, or maybe 16GB if the GPU is totally empty and you tune your quantization carefully. And the cheapest way to that (until 2025) is a used MI60, P40 or 3090.

[–] [email protected] 2 points 3 days ago* (last edited 3 days ago) (2 children)

I receive alerts when people are outside my house, using security cameras, Blue Iris, CodeProject AI, Node-RED and Home Assistant, using a Google Coral for local AI. Entirely local - no cloud services apart from Google's notification system to get notifications to my phone while I'm not home (which most Android apps use). That's a good use case for AI since it avoids false positives that occur with regular motion detection.

load more comments (2 replies)
[–] [email protected] 13 points 3 days ago (1 children)

The saddest part is, this is going to cause yet another AI winter. The first few ones were caused by genuine over-enthusiasm but this one is purely fuelled by greed.

[–] [email protected] 9 points 3 days ago

The AI ecosystem is flooded, we need a good bubble pop to slow down the massive waste of resources that our current info-remix-based-on-what-you-will-likely-react-positively-to shit-tier AI represents.

[–] [email protected] 8 points 3 days ago (1 children)

Agreed that’s why it’s so dangerous. These tech bros are going to do damage with their shitty products. It seems like it's Altman's goal, honestly.

[–] [email protected] 7 points 3 days ago

He wants money/power, and he is getting it. The rest of the AI field will forever be haunted by his greed.

[–] [email protected] 5 points 3 days ago (4 children)

After getting my head around the basics of the way LLMs work I thought "people rely on this for information?", the model seems ok for tasks like summarisation though

[–] [email protected] 8 points 3 days ago

I don’t love it for summarization. If I read a summary, my takeaway may be inaccurate.

Brainstorming is incredible. And revision suggestions. And drafting tedious responses, reformatting, parsing.

In all cases, nothing gets attributed to me unless I read every word and am in a position to verify the output. And I internalize nothing directly, besides philosophy or something. Sure can be an amazing starting point especially compared to a blank page.

load more comments (3 replies)
[–] [email protected] 3 points 3 days ago (1 children)

TSMC's allegedly calling Sam Altman a 'podcast bro' is spot on, and I'd add "manipulative vampire" to that.

What's the source for that? It sounds hilarious

[–] [email protected] 12 points 3 days ago

https://web.archive.org/web/20240930204245/https://www.nytimes.com/2024/09/25/business/openai-plan-electricity.html

When Mr. Altman visited TSMC’s headquarters in Taiwan shortly after he started his fund-raising effort, he told its executives that it would take $7 trillion and many years to build 36 semiconductor plants and additional data centers to fulfill his vision, two people briefed on the conversation said. It was his first visit to one of the multibillion-dollar plants.

TSMC’s executives found the idea so absurd that they took to calling Mr. Altman a “podcasting bro,” one of these people said. Adding just a few more chip-making plants, much less 36, was incredibly risky because of the money involved.

[–] [email protected] 3 points 3 days ago (2 children)

It's selling the future, but nobody knows if we can actually get there

[–] [email protected] 6 points 3 days ago

It's selling an anticompetitive dystopia. It's selling a Facebook monopoly vs selling the Fediverse.

We dont need 7 trillion dollars of datacenters burning the Earth, we need collaborative, open source innovation.

[–] [email protected] 3 points 3 days ago

The first part is true .... no one cares about the second part of your statement.

load more comments (22 replies)