this post was submitted on 07 Jun 2026
620 points (98.7% liked)

Technology

85213 readers
4095 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 3 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] Burninator05@lemmy.world 1 points 1 hour ago

I wish that was inversely proportional. The less I pay, somehow it costs them more money.

[–] nullspace@lemmy.world 6 points 2 hours ago* (last edited 2 hours ago) (2 children)

I can't imagine paying for AI when the open source tools have made it so easy to set up a model locally.

[–] potustheplant@feddit.nl 7 points 2 hours ago

Don't be daft. The vast majority of people don't have the knowledge or resources to set that up locally.

[–] ranzispa@mander.xyz 0 points 2 hours ago

Easy to set up, but still needs a 15k $ graphics card and electricity bill. The price you pay openai/anthropic is much cheaper than that for that quality of model.

Sure, you can setup a small model on a consumer graphics card, but the output will be considerably worse and the processing speed considerably lower.

For 240€/year you got a subscription to anthropic which will happily ingest a whole repository and process it in about one minute. No matter what latest model GPU you installed on your computer, you won't be able to do that.

Sure, this guy was able to run a 26B model on an old CPU: https://point.free/blog/gemma-4-on-a-2016-xeon/

But that was not easy at all and the speed you get is definitely not the same as the one provided for a very cheap price.

[–] BigDanishGuy@sh.itjust.works 8 points 3 hours ago (1 children)

Nice try, I ain't gonna pay anyway

[–] Scrollone@feddit.it 0 points 3 hours ago

Exactly, I just keep using the free plan and when I finish the amount for the day I just switch to another service

[–] LovableSidekick@lemmy.world 4 points 2 hours ago (1 children)

I wonder how much they spend for every $0 I pay them.

[–] M0oP0o@mander.xyz 2 points 2 hours ago

$1000 I would guess. They are just burning money at this point.

[–] mfed1122@discuss.tchncs.de 18 points 6 hours ago (1 children)

I mean, this is no different than Walmart making prices low until other businesses die out and then raising them.

It is no different than police shoving all the homeless people and drug addicts into one area of town to crash the property prices, and then evicting them once developers buy everything for cheap.

They're purposely operating at a loss in the expectation that they can get ingrained into a ton of workflows, and then gouge everyone absolutely to death while also worsening the quality of the service to make it cheaper for them to run.

If it weren't so horrible for the environment, I'd kind of like it, because all the dumbass executives that are signing up for this are going to get exactly what they deserve. You'd think they'd recognize a scheme when they see one.

[–] fishy@lemmy.today 13 points 6 hours ago (1 children)

My CEO (whom I don't consider a particularly good or bad CEO) spent a day playing with AI then when asked if he'd sign the company up with the service he literally laughed in their faces and said it's useless. I was honestly shocked because he's totally into buzzword and popular crap. Gained a lot of respect for him that day.

[–] WorldsDumbestMan@lemmy.today 1 points 4 hours ago (1 children)

An older co-worker seems to ask AI for help during work, we are blue collar. But the Owner of the company does not seem to use it whatsoever.

I ask Claude on occasion, to see if it will say something smart (it was mostly useless as fuck).

[–] Scrollone@feddit.it 2 points 3 hours ago (1 children)

Honestly I think Claude it's good at programming. Way better than ChatGPT.

But I ain't going to pay for it.

[–] ranzispa@mander.xyz 2 points 2 hours ago

Published a library doing some very specific data processing. One of the algorithms I implemented was a bit too slow: it would take about a week to process data. I reckon implementation was a little bit sloppy, but I've been implementing a bunch of algorithms from research papers and this was pretty much the published implementation.

I asked Claude to analyse the implementation and check whether it could be improved, half an hour later I got a 26,000% improvement in performance with exactly the same results passing all tests.

Of course, I could have done that myself. But optimization had to go down to simd level; I doubt I would have been able to do that in less than a week of work.

[–] AdolfSchmitler@lemmy.world 13 points 6 hours ago

Trust me bro we're so close to profitability bro, just need this IPO to secure funding one last time bro then we'll be profitable bro I swear.

[–] ReginaPhalange@lemmy.world 4 points 5 hours ago

Oh come on bubble, why won't you crash already?

[–] ParlimentOfDoom@piefed.zip 16 points 10 hours ago* (last edited 10 hours ago)

Now, I'm no MBA, but that seems like a bad business plan...

[–] Cornpop@lemmy.world 17 points 11 hours ago (3 children)

What is the actual “cost” after they buy the hardware, is that $1000 really pure power usage cost?

[–] Corkyskog@sh.itjust.works 16 points 11 hours ago (1 children)

The problem is that the hardware has a 5 or 6 year depreciation schedule on paper, but NVIDIA keeps saying that their next generation chip will be twice as good as their last chip so there is a FOMO schedule of like every two years.

[–] nullspace@lemmy.world 2 points 2 hours ago

Would be nice to see that used hardware for sale rather than it being junked as a writeoff.

[–] rumba@lemmy.zip 4 points 11 hours ago

that's the $84,000 question. They're filling datacenters with the fastest possible equipment and need it to be 10x faster, That hardware is dinosaur fodder a year after they install it.

[–] HereIAm@lemmy.world 4 points 11 hours ago (2 children)

I'm curious as well. My knowledge is probably quite outdated, but from what I understood the training part is what's expensive and then querying the model is pretty cheap. Is it still true (or was it ever) that the generated answers on search engines are cheaper to generate than the actual search results?

[–] CheeseNoodle@lemmy.world 11 points 11 hours ago (2 children)

I find that hard to believe, I recently had to uninstall co-pilot after it weaseled its way into my search bar. Its not an exageration to say that my PC literally ran cyberpunk 2077 with pathtracting better than it ran the fucking windows search bar with co-pilot.

[–] NikkiDimes@lemmy.world 2 points 6 hours ago* (last edited 6 hours ago)

That's just a shitty front end interface implementation, it has nothing to do with the actual inference run by the models.

[–] SorryQuick@lemmy.ca 2 points 7 hours ago

Look at the public numbers, it seems true. Copilot on your taskbar is just windows being garbage, not the AI being bad. Just look at self-hosted AI and measure the power costs of your queries. It’s tiny.

[–] Shteou@lemmy.world 2 points 10 hours ago

It is sorta. Training is orders of magnitudes more intensive than inference, but we infer billions of times within a model generation.

[–] Fizz@lemmy.nz 8 points 11 hours ago

The author is right and wrong. Its subsidised but not by anthropic. The power users who use their plans to the limit are subsidised by the rest of the users. Im an AI hater but I do think anthropic will be profitable next year. Their revenue growth is insane and looks to just be getting started. Claude code took enterprise by storm and now cowork is out.

[–] Mwa@thelemmy.club 4 points 10 hours ago (1 children)

no wonder why OpenAI is losing alot of money.

[–] Scrollone@feddit.it 1 points 3 hours ago

They'll never earn

[–] Fubarberry@sopuli.xyz 37 points 19 hours ago (9 children)

Honestly Google is likely to beat openAI and Anthropic as things are.

OpenAI and Anthropic have to buy/rent their hardware from Nvidia, while Google is making their own TPU hardware. Google's hardware costs on AI is way lower, every dollar they spend on it goes a lot farther.

And unlike the other two, they're already a profitable company. They're making record profits right now. They don't have a desperate need to figure out how to make back billions on their AI models, they can just keep offering Gemini at a comparatively cheap price and wait for anthropic and open AI to bankrupt themselves.

[–] ryper@lemmy.ca 5 points 6 hours ago (2 children)
[–] Fubarberry@sopuli.xyz 2 points 3 hours ago* (last edited 3 hours ago)

That's definitely costing them more than running it on their own hardware, but it doesn't mean AI is costing them more than the AI startups. Anthropic for example is already paying SpaceX 1.25 Billion a month for compute, and has agreed to pay Google 200Billion oflcer the next 5 years for access to Google's compute and TPU chips.

Google's deal with xAI specifically lets them terminate the deal with 90 days notice after the end of the year. Google is also investing heavily in building new data centers with their hardware. I'm assuming this deal means they've eclipsed their current TPU capacity, and are just looking for a short term bandaid until they can catch up with their new constructions.

[–] NikkiDimes@lemmy.world 1 points 4 hours ago* (last edited 4 hours ago)

Anthropic is doing the same too. SpaceX over here providing the shovels and pans for the modern day gold rush, sheesh.

[–] zbyte64@awful.systems 2 points 7 hours ago (1 children)

I guess google's announcement of renting xai compute could have been simply for show to boost SpaceX ipo.

[–] Fubarberry@sopuli.xyz 1 points 6 hours ago

They have big plans to build more data centers for themselves, so they definitely want more compute than the have access to right now. But even if they're paying more to rent xai compute, they're still paying less overall for hardware/access than their direct AI competition.

[–] jj4211@lemmy.world 4 points 11 hours ago (1 children)

Plus they have a hook with the common folk, the phone steers you toward Gemini (Android phones, obviously, and Apple currently partners with Google for Gemini for iPhone...).

For Claude and OpenAI, you have to explicitly want to go out of your way to use them, or use them indirectly through another service that has a hook.

Claude seems to have some software developers explicitly preferring them, though a alot of the corporate money is on Microsoft and Microsoft leveraged Visual Studio and Github to become the business-friendly frontend, and sure, you can use Anthropic models too... Though Microsoft ultimately has control of what is reasonably available and how much each one costs. Anthropic has a shot but I could see Microsoft pivot to really mess with Anthropic. The one gap in Microsoft strategy is the "native AI" workflow where Claude Code has won hearts and minds, but it uses massively more tokens for frankly marginal or sometimes negative value compared to a more curated use in-editor.

OpenAI I see as the most exposed. Lot's of data showing they are suffering from people being over the fad of going out of their way to use ChatGPT, especially since their phones have started embracing 'default' Chatbot. Software developers that are inclined to use LLM are also inclined to be pretty dismissive of anything other than either Anthropic or open weight models, depending on their inclination. Also Altman seemed the most agressive in committing to spending money they didn't have, though all of them exhibit this to some extent.

I predict Microsoft ultimately pivots to in-house models and convinces the businesses to go that way. Apple may continue with Gemini or roll their own eventually. Anthropic currently has the stronger position between OpenAI and them, but I think you are right that both have risk of just being left behind.

Claude just kills the other models, it's not close. Microsoft could ban claude extension from VSC tonight and ill start using command line Claude tomorrow. There's just no comparison right now. Itd be like Microsoft trying to ban NVidia gpus from Windows, they'll just lose.

load more comments (6 replies)
[–] PattyMcB@lemmy.world 22 points 17 hours ago (2 children)

Good thing I don't personally pay them anything

load more comments (2 replies)
load more comments
view more: next ›