this post was submitted on 04 Sep 2024
263 points (100.0% liked)

TechTakes

1428 readers
297 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] lvxferre@mander.xyz -2 points 2 months ago* (last edited 2 months ago) (4 children)

You could use them to know what the text is about, and if it's worth your reading time. In this situation, it's fine if the AI makes shit up, as you aren't reading its output for the information itself anyway; and the distinction between summary and shortened version becomes moot.

However, here's the catch. If the text is long enough to warrant the question "should I spend my time reading this?", it should contain an introduction for that very purpose. In other words if the text is well-written you don't need this sort of "Gemini/ChatGPT, tell me what this text is about" on first place.

EDIT: I'm not addressing documents in this. My bad, I know. [In my defence I'm reading shit in a screen the size of an ant.]

[–] queermunist@lemmy.ml 21 points 2 months ago* (last edited 2 months ago) (1 children)

ChatGPT gives you a bad summary full of hallucinations and, as a result, you choose not to read the text based on that summary.

[–] lvxferre@mander.xyz -5 points 2 months ago (1 children)

(For clarity I'll re-emphasise that my top comment is the result of misreading the word "documents" out, so I'm speaking on general grounds about AI "summaries", not just about AI "summaries" of documents.)

The key here is that the LLM is likely to hallucinate the claims of the text being shortened, but not the topic. So provided that you care about the later but not the former, in order to decide if you're going to read the whole thing, it's good enough.

And that is useful in a few situations. For example, if you have a metaphorical pile of a hundred or so scientific papers, and you only need the ones about a specific topic (like "Indo-European urheimat" or "Argiope spiders" or "banana bonds").

That backtracks to the OP. The issue with using AI summaries for documents is that you typically know the topic at hand, and you want the content instead. That's bad because then the hallucinations won't be "harmless".

[–] queermunist@lemmy.ml 14 points 2 months ago (1 children)

But the claims of the text are often why you read it in the first place! If you have a hundred scientific papers you're going to read the ones that make claims either supporting or contradicting your research.

You might as well just skim the titles and guess.

[–] dgerard@awful.systems 20 points 2 months ago

Both the use cases here are goverment documents. I'm baffled at the idea of it being "fine if the AI makes shit up".

[–] V0ldek@awful.systems 6 points 2 months ago

if the text is well-written you don’t need this sort of “Gemini/ChatGPT, tell me what this text is about” on first place.

And if it's badly written then the LLM will shit itself.

Now let's ask ourselves how much of the text in the world is "well-written"?

Or even better, you could apply this to Copilot. How much code in the world is good code? The answer is fucking none, mate.

[–] pikesley@mastodon.me.uk 6 points 2 months ago (1 children)
[–] lvxferre@mander.xyz 2 points 2 months ago (1 children)

No, it's just rambling. My bad.

I focused too much on using AI to summarise and ended not talking about it summarising documents, even if the text is about the later.

And... well, the later is such a dumb idea that I don't feel like telling people "the text is right, don't do that", it's obvious.

[–] dgerard@awful.systems 9 points 2 months ago

You'd think so, but guess what precise use case LLMs are being pushed hard for.