this post was submitted on 20 Jan 2026
613 points (98.7% liked)

Fuck AI

5268 readers
2319 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago
MODERATORS
 
(page 2) 50 comments
sorted by: hot top controversial new old
[–] dumbass@piefed.social 64 points 15 hours ago (8 children)

Wikipedia is one of the last genuine places on the Internet, and these rat bastards are trying to contaminate that, too

Wikipedia just sold the rights to use Wikipedia for AI training to Microsoft and openai....

[–] udon@lemmy.world 12 points 12 hours ago (3 children)

How exactly does that work? Wikipedia does not "own" the content on the website, it's all CC-BY licensed.

load more comments (3 replies)
[–] Alcoholicorn@mander.xyz 11 points 15 hours ago (15 children)

Why? Wikipedia has like a decade of operating expenses on hand, so they don't need the money

[–] surewhynotlem@lemmy.world 31 points 15 hours ago

This number inflates every time I read it. First it was ten years of hosting cost. Then it's operating costs. Soon it will be ten years of the entire US GDP.

I'd believe they have ten years of hosting costs on hand.

My quick googling says they have 170m in assets and all 180m in annual operating costs. Give or take.

load more comments (14 replies)
load more comments (6 replies)
[–] Lumidaub@feddit.org 174 points 17 hours ago (2 children)

Seeing as OpenAI struggled to make its AI avoid the em dash and still hasn't entirely managed to do it, I'm not too worried.

[–] FiniteBanjo@feddit.online 81 points 17 hours ago (1 children)

TBF OpenAI are a bunch of idiots running the world's largest ponzi scheme. If DeepMind tried it and failed then...

Well I still wouldn't be surprised, but at least it would be worth citing.

[–] chickenf622@sh.itjust.works 34 points 16 hours ago (4 children)

I think the inherit issue is the current "AI" is inherently non-deterministic, so it's impossible to fix these issues totally. You can feed am AI all the data on how to not sound AI, but you need massive amounts of non-AI writing to reinforce that. With AI being so prevalent nowadays you can't guarantee a dataset nowadays is AI free, so you get the old "garbage in garbage out" problem that AI companies cannot solve. I still think generative AI has it's place as a tool, I use it for quick and dirty text manipulation, but it's being applied to every problem we have like it's a magic silver bullet. I'm ranting at this point and I'm going to stop here.

[–] FiniteBanjo@feddit.online 21 points 16 hours ago (11 children)

I honestly disagree that it has any use. Being a statistical model with high variance makes it a liability, no matter which task you use it for will produce worse results than a human being and will create new problems that didn't exist before.

[–] hector@lemmy.today 1 points 9 hours ago (1 children)

Ai is useful for sorting datasets amd pulling relevent info in some cases, ie propublica has used it for articles.

Obviously simple sorting for them, case law is too complicated for such sifting of data, it was trained on reddit after all.

[–] FiniteBanjo@feddit.online 1 points 8 hours ago (1 children)

And when, not if but when, it makes a mistake by pulling hallucinated info or data then it's going to be you're fault, that's why it's a liability.

[–] hector@lemmy.today 1 points 8 hours ago

The simple stuff it can do, trying to remember how propublica used it, but it was just like sifting through a database and pulling out all mentions of a word.

When you get into giving case law, it's way too complicated for it and it hallucinates.

[–] Cethin@lemmy.zip 3 points 13 hours ago* (last edited 13 hours ago)

If you're running it locally you can set how much variance it has. However, I mostly agree, in that it creates a bunch of trash. This doesn't mean it has no use though. It's like the monkeys on a typewriter thought experiment, but the monkey's output is fairly constrained so it takes much fewer attempts to create what you want. It depends on the complexity of the solution required whether it'll come up with a good solution in a reasonable amount of tries. If it's a novel solution, it probably never will, because it's constrained to solutions it's seen before.

load more comments (9 replies)
[–] hector@lemmy.today 1 points 9 hours ago* (last edited 9 hours ago)

We should crowdsource a program to sniff out ai data crawlers, then poison the data they harvest without them knowing, for companies to employ.

[–] homura1650@lemmy.world 3 points 12 hours ago

Datasets are not the only mechanism to train AI. You can also use reinforcement learning. This requires you to have a good fitness function. In some domains, that is not a problem. For LLMs, however, we do not have such a function. However, we can use a hybrid approach, where we train a model based on a data set and optimizing for fitness functions that address part of what we want (e.g. avoiding em dashes). In practice, this tends to be tricky, as ML tends to be a bit too good at optimizing for fitness functions, and will often do it in ways you don't want. This is why if you want to develop a real AI product, you actually need AI engineers who know what they are doing; not prompt engineers who will try and find the magic incantation that makes someone else's AI do what they want

load more comments (1 replies)
[–] 0_o7@lemmy.dbzer0.com -1 points 5 hours ago (1 children)

You have to understand that their public facing product is not the same as the one they allow enterprise or state actors to use.

They benefit from public thinking they have these stupid limitations, gives them more space to curate their product offerings where the real money is made.

load more comments (1 replies)
[–] phonics@lemmy.world 13 points 12 hours ago (1 children)

Bro isnt even gonna check its output anyway.

load more comments (1 replies)
[–] RoidingOldMan@lemmy.world 14 points 12 hours ago

It can't avoid doing those things. That's the reason for the article.

[–] DFX4509B@lemmy.wtf 51 points 17 hours ago (1 children)

Download an offline copy while you still can.

[–] umbrella@lemmy.ml 2 points 12 hours ago (1 children)

what are the best ways to do it while still having practical access?

[–] DFX4509B@lemmy.wtf 8 points 12 hours ago* (last edited 12 hours ago) (1 children)

Here's a link to the Kiwix library download for all of Wikipedia. It's 111GB though, so you'll need a lot of space and also a lot of time to wait for it to download.

Note, you'll also need Kiwix in some manner to read the zim file once it's downloaded.

Kiwix library - All of Wikipedia - direct download link

Kiwix app download page

But this'll let you have a local copy you can reference should actual Wikipedia ever get ruined by GenAI, or worse, get taken down by hostile governments.

load more comments (1 replies)
[–] pedz@lemmy.ca 30 points 16 hours ago (5 children)

In French, one of the way to spot AI writing is that sentences will often miss articles or have bad grammar. Can this dude also ask the LLM to include more articles and make complete sentences in the language it's trying to imitate?

I was using the Discover feed on my phone but Google started to insert rewritten stories & headlines by AI and they were so annoyingly bad at making simple sentences in French that it made me stop using that thing.

[–] destructdisc@lemmy.world 18 points 15 hours ago

We'd rather the dude kill the LLM entirely. No one needs that shit

[–] Damage@feddit.it 5 points 13 hours ago (1 children)

Weird, in Italian they usually have impeccable grammar

load more comments (1 replies)
load more comments (3 replies)
[–] HotsauceHurricane@lemmy.world 12 points 13 hours ago

Jesus Christ what a wretched twit of a man.

[–] snoons@lemmy.ca 37 points 16 hours ago

Fuck you, Siqi Chen.

[–] Bloomcole@lemmy.world 0 points 6 hours ago (3 children)

Wikipedia is astroturfed BS for anything remotely politically related.
Useful if you want to learn about the Ivory-billed Woodpecker or a closed-cycle regenerative heat engine, etc..
So no politics and subjects with political implications such as history.

load more comments (3 replies)
[–] gustofwind@lemmy.world 13 points 15 hours ago (5 children)

And now you know how and why so many programmers are just fucking awful and literally responsible for the hell we’re living in

Kinda surprised how they don’t get more hate programmers fucking suck

[–] Jankatarch@lemmy.world 24 points 15 hours ago* (last edited 15 hours ago) (2 children)

Wow, such programmer.

Especially that "investor" in twitter bio and all his posts about finance.

Hell even if he was a programmer, disney hires artists as well. Entire art community is transphobic now?

(I am sorry if comment was meant to be satirical)

Edit : He is apparently a CEO too.

[–] MountingSuspicion@reddthat.com 9 points 15 hours ago

I was about to defend the lack of contributions and then I kept reading. I have a handful of different accounts I use and some have the same look about them, but yea the investor thing is an obvious tell.

load more comments (1 replies)
[–] green_red_black@slrpnk.net 13 points 14 hours ago (1 children)

You do know Programers are behind the Fediverse correct?

[–] gustofwind@lemmy.world 0 points 5 hours ago (1 children)

And some lawyers defend the innocent but the profession as a whole is rotten to the core

load more comments (1 replies)
[–] Bazell@lemmy.zip 2 points 13 hours ago* (last edited 13 hours ago)

Programmers is a total name of a whole amount of people doing some sort of programming at all. The ones who heavily rely on AI and don't do programming well are called vibecoders as far as I know.

load more comments (2 replies)
load more comments
view more: ‹ prev next ›