this post was submitted on 02 Mar 2026
418 points (98.4% liked)

Fuck AI

7069 readers
1956 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[โ€“] pkjqpg1h@lemmy.zip 10 points 3 months ago (1 children)

According to the AA-Omniscience benchmark

The most expensive models,

Opus 4.6 has a 60% hallucination rate and 46% accuracy rate. Gemini 3.1 Pro Preview has a 50% hallucination rate and 55% accuracy rate.

And the questions aren't even open-ended.

I don't even need to tell you about the other models.

[โ€“] LodeMike@lemmy.today 4 points 3 months ago* (last edited 3 months ago)

"Opus 4.6" like every other LLM has a 100% hallucination rate because that's the literal only thing they do.