this post was submitted on 30 Oct 2024

347 points (91.6% liked)

Ask Lemmy

28031 readers

1533 users here now

A Fediverse community for open-ended, thought provoking questions

Rules: (interactive)

1) Be nice and; have fun

Doxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can't say something nice, don't say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them

2) All posts must end with a '?'

This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?

3) No spam

Please do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.

4) NSFW is okay, within reason

Just remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either [email protected] or [email protected]. NSFW comments should be restricted to posts tagged [NSFW].

5) This is not a support community.

It is not a place for 'how do I?', type questions. If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email [email protected]. For other questions check our partnered communities list, or use the search function.

6) No US Politics.

Please don't post about current US Politics. If you need to do this, try [email protected] or [email protected]

Reminder: The terms of service apply here too.

Partnered Communities:

Logo design credit goes to: tubbadu

founded 2 years ago

MODERATORS

[email protected]

347

Do any non-corpos actually like AI slop? (lemmy.world)

submitted 3 months ago by [email protected] to c/[email protected]

286 comments fedilink hide all child comments

I've found that AI has done literally nothing to improve my life in any way and has really just caused endless frustrations. From the enshitification of journalism to ruining pretty much all tech support and customer service, what is the point of this shit?

I work on the Salesforce platform and now I have their dumbass account managers harassing my team to buy into their stupid AI customer service agents. Really, the only AI highlight that I have seen is the guy that made the tool to spam job applications to combat worthless AI job recruiters and HR tools.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 5 points 3 months ago (4 children)

It's great at summarization and translations.

[–] [email protected] 12 points 3 months ago (2 children)

Might want to rethink the summarization part.

AI also hasn’t made any huge improvements in machine translation AFAIK. Translators still get hired because AI can’t do the job as well.

[–] [email protected] 5 points 3 months ago

Thank you for pointing that out. I don't use it for anything critical, and it's been very useful because Kagi's summarizer works on things like YouTube videos friends link which I don't care enough to watch. I speak the language pair I use DeepL on, but DeepL often writes more natively than I can. In my anecdotal experience, LLMs have greatly improved the quality of machine translation.

[–] [email protected] 1 points 3 months ago* (last edited 3 months ago) (1 children)

The AI summaries were judged significantly weaker across all five metrics used by the evaluators, including coherency/consistency, length, and focus on ASIC references. Across the five documents, the AI summaries scored an average total of seven points (on ASIC's five-category, 15-point scale), compared to 12.2 points for the human summaries.

The focus on the (now-outdated) Llama2-70B also means that "the results do not necessarily reflect how other models may perform" the authors warn.

to assess the capability of Generative AI (Gen AI) to summarise a sample of public submissions made to an external Parliamentary Joint Committee inquiry, looking into audit and consultancy firms

In the final assessment ASIC assessors generally agreed that AI outputs could potentially create more work if used (in current state), due to the need to fact check outputs, or because the original source material actually presented information better. The assessments showed that one of the most significant issues with the model was its limited ability to pick-up the nuance or context required to analyse submissions.

The duration of the PoC was relatively short and allowed limited time for optimisation of the LLM.

So basically this study concludes that Llama2-70B with basic prompting is not as good as humans at summarizing documents submitted to the Australian government by businesses, and its summaries are not good enough to be useful for that purpose. But there are some pretty significant caveats here, most notably the relative weakness of the model they used (I like Llama2-70B because I can run it locally on my computer but it's definitely a lot dumber than ChatGPT), and how summarization of government/business documents is likely a harder and less forgiving task than some other things you might want a generated summary of.

[–] [email protected] 1 points 3 months ago (1 children)

Please share any studies you have showing AI is better than a person at summarizing complex information.

[–] [email protected] 1 points 3 months ago (1 children)

If it wasn't clear, I am not claiming that AI is better than a person at summarizing complex information.

[–] [email protected] 2 points 3 months ago

My bad for misunderstanding you.

[–] [email protected] 11 points 3 months ago (1 children)

LLMs are TERRIBLE at summarization

[–] [email protected] 4 points 3 months ago* (last edited 3 months ago)

Downvoters need to read some peer reviewed studies and not lap up whatever BS comes from OpenAI who are selling you a bogus product lmao. I too was excited for summarization use-case of AI when LLMs were the new shiny toy, until people actually started testing it and got a big reality check

[–] [email protected] 11 points 3 months ago (1 children)

Until it makes shit up that the original work never said.

[–] [email protected] 5 points 3 months ago (1 children)

The services I use, Kagi's autosummarizer and DeepL, haven't done that when I've checked. The downside of the summarizer is that it might remove some subtle things sometimes that I'd have liked it to keep. I imagine that would occur if I had a human summarize too, though. DeepL has been very accurate.

[–] [email protected] 4 points 3 months ago

LLMs are especially bad for summarization for the use case of presenting search results. The source is just as critical of information for search as the information itself, and LLMs obfuscate this critical source information and combine results from multiple sources together...

[–] [email protected] 2 points 3 months ago (1 children)

tl;dr?

[–] [email protected] 1 points 3 months ago

Translates Sumerian texts.