868
Reddit stock falls for second day as references to its content in ChatGPT responses plummet
(finance.yahoo.com)
This is a most excellent place for technology news and articles.
The fact that any AI company thought to train their LLM on the answers of Reddit users speaks to a fundamental misunderstanding of their own product (IMO)
LLMs aren't programmed to give you the correct answer. They're programmed to give you the most pervasive/popular answer on the assumption that most of the time that will also happen to be the right one.
So when you're getting your knowledge base from random jackasses on Reddit, where a good faith question like "What's the best way to get get gum out of my childs hair" get's two two good faith answers, and then a few dozen smart-ass answers that gets lots of replies and upvotes because they're funny. Guess which one your LLM is going to use.
People (and apparently even the creators themselves) think that an LLM is actually cognizent enough to be able to weed this out logically. But it can't. It's not an intelligence...it's a knowlege agreggator. And as with any aggregator, the same rule applies
garbage in, garbage out
Thats why I have stopped calling it ai. Its a dumbass buzzword just like cloud, that tech bros like to use but cant explain (or blockchain).
Its llms, and image generators/OCR (which has been around for decades), Using complex markov chains and a fuck ton of graphics cards. NOT AI. NOT AI.