this post was submitted on 24 May 2026
487 points (96.0% liked)

Technology

84923 readers
4422 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
(page 3) 46 comments
sorted by: hot top controversial new old
[–] yesman@lemmy.world 11 points 1 day ago (6 children)

I'm unfamiliar with AI chatbots that you pay for. What is a token?

In very simple terms, a token is more or less a word. You pay per input and output tokens (your prompts and the answers) as they correlate the most closely to the energy expended by the LLM to process your request.

load more comments (5 replies)
[–] BlackLaZoR@lemmy.world 2 points 1 day ago

No wonder. Since deepseek has open license, they have to compete with 3rd party providers, and in case of smallest models with local generation.

[–] Gerudo@lemmy.zip 7 points 1 day ago

Damn, fire sale already?

[–] byte_0verflow@lemmy.ml 8 points 1 day ago

Thank you daddy Xi

[–] Mwa@thelemmy.club 5 points 1 day ago* (last edited 1 day ago) (1 children)

Still gonna self host it instead (maybe)

[–] qaz@lemmy.world 3 points 1 day ago (2 children)

FYI the flash model is ~158 GB

[–] Tja@programming.dev 2 points 1 day ago (3 children)

How are they running it? Doesn't the model have to fit in (V)RAM? Does Nvidia have such huge memories in the H cards?

Lots of GPUs together.

[–] boonhet@sopuli.xyz 1 points 1 day ago* (last edited 1 day ago)

For self hosting it essentially needs to fit in VRAM + RAM but it'll take a lot of CPU for the part in RAM

Deepseek probably uses those big fancy H cards and not one but several together to increase VRAM.

load more comments (1 replies)
[–] Mwa@thelemmy.club 1 points 1 day ago

The destiled models?

[–] RickyRigatoni@piefed.zip 1 points 1 day ago (2 children)

How fast do you burn through tokens that $4 for a million of them was a lot of money?

[–] eager_eagle@lemmy.world 5 points 1 day ago

If you use it for Q&A, that's a lot of tokens. If you use it to write software somewhat autonomously, it's easy to go through a million tokens every few hours. Do that every day and you'll be paying over $100 a month at that rate.

load more comments (1 replies)
[–] Fizz@lemmy.nz 1 points 1 day ago (2 children)

They have to cut the price because its behind the frontier models. No one would buy it otherwise

[–] eager_eagle@lemmy.world 4 points 1 day ago* (last edited 22 hours ago)

The ones paying attention and on a budget would still use them. "The best" of anything is usually not cost effective.

Even before reducing the prices, they were already 2 to 3 times cheaper than equivalent alternatives from Anthropic's ($3in, $15out) and OpenAI's ($1.75in, $14out) at $1.74in and $3.48out. Now they're around 10x cheaper.

Edit: Deepseek V4 Flash is the leading model on OpenRouter

so much for no one buying it

[–] badgermurphy@lemmy.world 1 points 1 day ago (1 children)

Also, the big models are in the infant stages of turning the monetization screws. This could be a tactic to knock their owners' legs out from under them. If they can't turn up their prices as much or as fast as they had hoped due to the competitive pressure, venture capital may begin to divest sooner. At their current debt levels, that would literally end OpenAI and I believe also Anthropic.

If a temporary "permanent" price cut wipes out some competitors, that will make for smoother sailing in the future for them to raise prices more than they otherwise would have been able to, quickly recouping the cost of the price reduction and then all profit after that.

[–] Fizz@lemmy.nz 2 points 1 day ago (1 children)

If it knocks out small competitiors then the big players will buy them up for the compute alone.

[–] badgermurphy@lemmy.world 1 points 23 hours ago (1 children)

I'm not aware of any noteworthy small competitors in the space. Who do you mean?

load more comments (1 replies)
[–] desmosthenes@lemmy.world -1 points 1 day ago* (last edited 1 day ago)
load more comments
view more: ‹ prev next ›