Technology

86387 readers

3303 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

ChatGPT, Google Gemini, Google AI Overviews, Grok and Replika AI bots made huge errors before Scottish election, study finds (demos.co.uk)

submitted 1 month ago by beep@piefed.world to c/technology@lemmy.world

3 comments fedilink hide all child comments

cross-posted from: https://piefed.world/c/tech/p/1140104/chatgpt-google-gemini-google-ai-overviews-grok-and-replika-ai-bots-made-huge-errors-befo

top 3 comments

sorted by: hot top controversial new old

[–] Deestan@lemmy.world 11 points 1 month ago

Yeah, it's what they do. Generate convincing text. Calling it "errors" makes as much sense as claiming my dice "produced errors" when I lost at yahtzee.

An illustrative example: https://kucharski.substack.com/p/real-signals-or-artificial-stereotypes

"First, I’d created 2000 free-text responses and labelled them ‘UK’. Then I copied and pasted the exact same 2000 responses but labelled these ‘US’. Finally, I combined them to create a dataset of 4000 total responses, and jumbled them up.

Despite the responses being identical for the UK and US, Copilot produced a rich, detailed summary of how US and UK respondents differed."

[–] Crozekiel@piefed.zip 5 points 1 month ago

LLMs making "errors" around elections might be the exact point of them.

[–] A_norny_mousse@piefed.zip 3 points 1 month ago

Researchers tested how the services ChatGPT, Google Gemini, Google AI Overviews, Grok and Replika performed on a single day during the 2026 Scottish pre-election window and found:

One third (34.1%) of responses across chatbots contained factual errors, whilst reliability varied significantly across services

Errors included getting the date of election day wrong, giving wrong information about the need for voters to bring ID, “hallucinating” a candidate, and making up an expenses scandal on one occasion, and a nepotism scandal on another.

We reveal new evidence of the scale of these services’ unreliability during elections and make recommendations for the government to close the regulatory gap.

The last bit is the most important imo: Chatbots must not be allowed to present themselves as providers of information. Nor should any commercial/official body be allowed to rely on them.