this post was submitted on 21 May 2026
34 points (100.0% liked)

Technology

84817 readers
4035 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
top 3 comments
sorted by: hot top controversial new old
[–] A_norny_mousse@piefed.zip 1 points 20 minutes ago

Researchers tested how the services ChatGPT, Google Gemini, Google AI Overviews, Grok and Replika performed on a single day during the 2026 Scottish pre-election window and found:

  • One third (34.1%) of responses across chatbots contained factual errors, whilst reliability varied significantly across services
  • Errors included getting the date of election day wrong, giving wrong information about the need for voters to bring ID, “hallucinating” a candidate, and making up an expenses scandal on one occasion, and a nepotism scandal on another.

We reveal new evidence of the scale of these services’ unreliability during elections and make recommendations for the government to close the regulatory gap.

The last bit is the most important imo: Chatbots must not be allowed to present themselves as providers of information. Nor should any commercial/official body be allowed to rely on them.

[–] Deestan@lemmy.world 8 points 2 hours ago

Yeah, it's what they do. Generate convincing text. Calling it "errors" makes as much sense as claiming my dice "produced errors" when I lost at yahtzee.

An illustrative example: https://kucharski.substack.com/p/real-signals-or-artificial-stereotypes

"First, I’d created 2000 free-text responses and labelled them ‘UK’. Then I copied and pasted the exact same 2000 responses but labelled these ‘US’. Finally, I combined them to create a dataset of 4000 total responses, and jumbled them up.

Despite the responses being identical for the UK and US, Copilot produced a rich, detailed summary of how US and UK respondents differed."

[–] Crozekiel@piefed.zip 4 points 2 hours ago

LLMs making "errors" around elections might be the exact point of them.