Technology

86580 readers

3579 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

677

Anthropic/OpenAI may be spending more than $1000 for every $100 you pay them (ea.rna.nl)

submitted 1 month ago by Trilogy3452@lemmy.world to c/technology@lemmy.world

176 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] pinball_wizard@lemmy.zip 1 points 1 month ago* (last edited 1 month ago)

So are we assuming here that LLMs won't become more efficient over time?

Mostly. Moore's law ran up against the physical limits of the materials we make chips out of - so desktops of today just do what the desktops of yesterday do, mostly.

We should keep seeing improvements in highly specialized models. There's interesting outcomes to have here, with the right setup and ollama.

but -

The really promising impressive models today are just running with long contexts on shithloads of hardware - which is neither coming to home PCs any time soon nor going to actually be profitable to run any time soon.

There's an argument to be made that running the really interesting model on a ton of hardware might make money for really specific uses - but then when we talk about specific uses that are worth lots of money, those use cases tend to tolerate difficult interfaces and reward accuracy. LLMs invariably reduce accuracy in exchange for ease of use. There might be a sweet spot for a huge expensive hallucination prone LLM in some of these uses, but I doubt it (the entire approach) competes, long term.

There's a few specific use cases where inaccuracy is desirable - largely forms of shifting accountability and some kinds of gambling. Things that either are or should be crimes have a higher tolerance for AI hallucination.

But - a small cheap local model has all the desirable attributes for doing these things (crimes) poorly as a big expensive model. So there's probably not even much money to be made there.

I expect that this tech is not going away, but it's also not earning back the current investment.