this post was submitted on 04 Sep 2024

264 points (100.0% liked)

TechTakes

1801 readers

54 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago

MODERATORS

[email protected]

264

Don’t use AI to summarize documents — it’s worse than humans in every way (pivot-to-ai.com)

submitted 7 months ago by [email protected] to c/[email protected]

110 comments fedilink hide all child comments

(page 2) 50 comments

sorted by: hot top controversial new old

[–] RagnarokOnline 9 points 7 months ago (13 children)

I had GPT 3.5 break down 6x 45-minute verbatim interviews into bulleted summaries and it did great. I even asked it to anonymize people’s names and it did that too. I did re-read the summaries to make sure no duplicate info or hallucinations existed and it only needed a couple of corrections.

Beats manually summarizing that info myself.

Maybe their prompt sucks?

[–] [email protected] 8 points 7 months ago (1 children)

How did you make sure no hallucinations existed without reading the source material; and if you read the source material, what did using an LLM save you?

[–] RagnarokOnline 1 points 7 months ago

I conducted the interviews myself alongside a colleague. The summary was for reporting our findings up to leadership.

[+] [email protected] -7 points 7 months ago (2 children)

I also use it for that pretty often. I always double check and usually it's pretty good. Once in a great while it turns the summary into a complete shitshow but I always catch it on a reread, ask a second time, and it fixes things up. My biggest problem is that I'm dragged into too many useless meetings every week and this saves a ton of time over rereading entire transcripts and doing a poor job of summarizing because I have real work to get back to.

I also use it as a rubber duck. It works pretty well if you tell it what it's doing and tell it to ask questions.

[–] [email protected] 9 points 7 months ago (2 children)

Isn't the whole point of rubber duck debugging that the method works when talking to a literal rubber duck?

[–] [email protected] 8 points 7 months ago

what if your rubber duck released just an entire fuckton of CO2 into the environment constantly, even when you weren’t talking to it? surely that means it’s better

[–] RagnarokOnline 1 points 7 months ago

Yup! I’ll feed in meeting transcripts and get a list of action steps to email out to everyone. If I was in project management, I’m pretty sure i’d outsource my entire job to LLMs.

load more comments (11 replies)

[–] [email protected] 5 points 7 months ago (2 children)

Is it only me, or is the linked article not super long on details & is reaching a conclusion from 2 examples? This is important & I need to hear more, & I’m generally biased against AI at this point— but the article isn’t doing enough to convince me

load more comments (2 replies)

[–] RagnarokOnline 2 points 7 months ago

This is hosted on awful.systems, so I get it, but why is any comment that’s even remotely pro-LLM getting downvoted into oblivion? Is gen-AI that unpopular?

[–] [email protected] -2 points 7 months ago* (last edited 7 months ago) (17 children)

You could use them to know what the text is about, and if it's worth your reading time. In this situation, it's fine if the AI makes shit up, as you aren't reading its output for the information itself anyway; and the distinction between summary and shortened version becomes moot.

However, here's the catch. If the text is long enough to warrant the question "should I spend my time reading this?", it should contain an introduction for that very purpose. In other words if the text is well-written you don't need this sort of "Gemini/ChatGPT, tell me what this text is about" on first place.

EDIT: I'm not addressing documents in this. My bad, I know. [In my defence I'm reading shit in a screen the size of an ant.]

[–] [email protected] 6 points 7 months ago (1 children)

@lvxferre @dgerard have you bumped your head?

[–] [email protected] 2 points 7 months ago (1 children)

No, it's just rambling. My bad.

I focused too much on using AI to summarise and ended not talking about it summarising documents, even if the text is about the later.

And... well, the later is such a dumb idea that I don't feel like telling people "the text is right, don't do that", it's obvious.

[–] [email protected] 9 points 7 months ago

You'd think so, but guess what precise use case LLMs are being pushed hard for.

[–] [email protected] 6 points 7 months ago

if the text is well-written you don’t need this sort of “Gemini/ChatGPT, tell me what this text is about” on first place.

And if it's badly written then the LLM will shit itself.

Now let's ask ourselves how much of the text in the world is "well-written"?

Or even better, you could apply this to Copilot. How much code in the world is good code? The answer is fucking none, mate.

load more comments (15 replies)

[–] [email protected] -5 points 7 months ago* (last edited 7 months ago) (13 children)

The problem is not the LLMs, but what people are trying to do with them.

They are currently spoons, but people are desperately wishing they were katanas.

They work really well for soup, but they can't cut steak. But they're being hyped as super ninja steak knives, and people are getting pissed when they can't cut steak.

If you give them watery, soupy tasks they can do successfully, they can lighten your workload, as long as you're aware of what they are and aren't good at.

What people want LLMs to be able to do, ie. "Steak" tasks:

write complex documents
apply complex knowledge/rules to a situation
Write complex code and create entire programs based on vague description

What LLMs can currently do ie. "Soup" tasks:

check this document and fix all spelling, punctuation and grammatical errors
summarise this paragraph as dot points
write a python program that sorts my photographs into folders based on the year they were taken

Half of Lemmy is hyping katanas, the other half is yelling "Why won't my spoon cut this steak?!! AI is so dumb!!!"

Update: wow, the pure vitriol pouring out of the replies is just stunning. Seems there are a lot of you out there who have, in one way or another, tied your ego very strongly to either the success or failure of AI.

Take a step back, friends, and go outside for a while.

[–] [email protected] 9 points 7 months ago* (last edited 7 months ago) (2 children)

Food analogy

This level of discourse wouldn't fly on 4chan, how is it so popular with LLM fans?

[–] [email protected] 9 points 7 months ago (1 children)

needs to be a car analogy

What people want LLMs to do, i.e. Corvette tasks
What LLMs actually do, i.e. Trabant tasks

[–] [email protected] 6 points 7 months ago

What LLMs actually do, i.e. Trabant tasks

more of a Power Wheels Barbie Jeep whose battery got left out in the sun too long, but I’ll allow it

[–] [email protected] 5 points 7 months ago

don't diss the course, this steak's great

[–] [email protected] 9 points 7 months ago (8 children)

"spoons and katanas" has got to be the most baby brained analogy. are you a child