FRESH

Hacker News

Home

The unbearable cheapness of open weight models

161 points by ddxv

by Tuna-Fish

3 subcomments

> What worries me about this is that Anthropic and OpenAI seem to have backed themselves into a corner of high costs. Can they reasonably decrease their prices by 20-50x to compete with DeepSeek or Xiaomi’s Mimo?
They have high prices, not high costs. They will obviously keep prices as high as they can for as long as they can, while keeping demand up. Once demand starts to fall, so will the prices.
> Are these models cheap because they are open weight and having hundreds or people stress test running them on different hardware helped to lower the cost? Or is it that they are being provided as loss leaders to drive the prices down?
Neither. They are cheap because they have neither technical edge nor brand power to keep the prices high, and so have to ask commodity prices for them.
People somehow still don't get it, despite everyone who studies the economics of it telling them: Inference is dirt cheap. Training is expensive, inference is cheap, and getting cheaper.

by Jackobrien

8 subcomments

The giants knew this was coming, and soon 95% of AI tasks will be able to be done by open models (coding, research, cowork style work). So why pay a premium? Why use them at all? This leaves the labs with two options:
1) push the frontier in a way only massive scale can, and cash in on it (mythos level cyber security, recursive training, frontier science work). There’s big money for never before possible capabilities.
2) own the app layer with their edge in reputation and powered by their infrastructure. Be apple where everyone else is Linux. Do design, coding, research, SMBs, legal, finance, healthcare and more (they are doing all of this).
Will it be enough to justify a Google level valuation? We’ll see how fast they can push it.

by arthurofbabylon

2 subcomments

Let's imagine that Anthropic/OpenAI fail to manufacture scarcity by villainizing Open Weight models (a sincere probability). What is left for these corporations to prop up their prices, or any margin at all? I expect scaffolding around tool use, supporting bespoke implementation and driving risk down for institutional adoption. (They might even build an insurance tool to protect accountants/lawyers from errors in compounded probabilism!)
A question for economists... It seems plainly clear to me that information and information processing is commodifying (for the first time in human history?). Without the age-old bottlenecks at the top of the value chain, capital will surely flow downwards, right?

by linzhangrun

3 subcomments

It would not be surprising if GPT and Claude get cheaper too as inference gets cheaper. Two years ago, o1 was the strongest model and cost much more than Fable, while being nowhere near as smart as a Qwen 3.6 35B that you can now run on a DGX Spark without much trouble.

by beepdyboop

3 subcomments

I don’t get it. So many here are saying open weight models will kill the frontier labs. But open source and similar have tried to beat private companies everywhere all the time, and people still buy the best products even if great open source alternatives are available. Why wouldn’t this be the case for AI too?

by arikrahman

3 subcomments

With cache hit rates being effectively free, harnesses like Reasonix have let me do a month of work for less than 2 dollars. It's not even the subsidies making it cheap, American providers like Digital Ocean or Cloudflare host the same model with similar pricing.

by drillsteps5

0 subcomment

Open weights models are cheap in the context of the article (when you run inference in the cloud) because they are free. When I pay for inference for running DeepSeek open weights model I only pay the inference service provider for compute/memory/storage/network throughput. The model itself is free, the developer isn't getting a dime.
Developing these things is NOT free, there's a lot of labor, hardware, compute/memory/storage/network that goes into that. Who's paying for all this? Chinese govt? Developers themselves? What's the revenue model here?
I absolutely LOVE ability to either run them locally or access inference providers on the cheap, but having a hard time understanding the financial side of this.

by odie5533

1 subcomments

This is what concerns me about how AI giants are planning to make money. Their product has already been commoditized at prices which for them are still subsidized to grab market share. Unless the giants invent a technological leap, their prices are going to be dragged down by open weight models and I don't see how they'll turn a profit.

by Zak

2 subcomments

One issue I keep seeing with cost comparisons is that they compare API rates while a substantial fraction of users are on subscription plans.
It's more expensive to use GLM 5.2 paying z.ai or Opencode Zen API rates than it is to use Opus on a subscription plan. Both of those providers offer subscriptions priced favorably relative to their API rates, but only in what are effectively trial sizes.

by anax32

2 subcomments

Open weight and local hosting is far, far cheaper. In every respect. Even support is cheaper, over time.
However, it's difficult to sell this to businesses who want contracts and KPIs, not staff and commitments.
Regulated industries will favour the closed sources, either by choice or mandate. The interesting question is whether they will have better models, or worse models. History says they will receive a worse service, but continue anyway.

by MoonWalk

3 subcomments

I'd appreciate an explanation of what "open weight model" means. Is it a "weight model" that is open, or a model with open weights (so should be "open-weight model"), or is it weights that can be applied to a model?
Are weights separable from a model? And if not, what is the point of saying "open-weight model" instead of just "open model?"
To the newcomer, it's hard to determine what the components of an AI system are from the throwing-around of these terms.

by bmnbmnbmn

1 subcomments

One of the purposes of open weight models is to create a moat. If there were no open models available, I think we'd see much more and better models coming from Europe by now. Right now, any startup wanting to build and sell a model needs to be substantially better than the open models, which has become increasingly difficult and expensive.

by my-next-account

1 subcomments

I wonder whether Oracle is going to go bankrupt because of this

by leroman

0 subcomment

The token-economics for closed source models are different, they are optimizing for 200 USD tokens worth of software engineer monthly usage, they will increase per token price as models or harnesses are more optimized.

by dwaite

1 subcomments

Where's a few good places to go to learn more about open weight models, both running hosted and running locally?

by CuriouslyC

0 subcomment

The government is going to ban foreign models and foreign inference providers, without question. The US govt is going to dig its dirty little fingers into OAI/Anthropic/Oracle/(probably)SpaceX and end up taking some stock for a sovereign wealth fund (probably timed to prop up flagging share prices, and with the promise of sweet government grift down the line), and at that point the bans will be framed as protecting that investment.

by surgical_fire

0 subcomment

One thing it doesn't even mention is how good those models are. Evet since I moved to DeepSeek I had zero regrets. It performs exceptionally well. I honestly prefer it to ChatGPT (or Claude that I use at work).
I never used Fable, maybe it is that much better. DeepSeek has no problems with the workloads I give it though - if it only keeps marginally improving with each interaction I don't see myself needing to come back.

by dist-epoch

0 subcomment

It's so refreshing to read a short to the point article, which is not extruded into 10 pages with LLMs.

by juancn

0 subcomment

90% of my model use is on local open-weights models.
The things that I need to automate do not need frontier models. Heck, even a gemma-4-12B-it-qat-UD-Q4_K_XL can deal with a lot of complexity if properly guided (it can run on 16GB of unified memory, for example on a base model Macbook Air).
I've been using it to translate Javascript to a custom scripting language in a product I work for, just by providing a system prompt and an MCP tool to call the target compiler to check for errors.
Sometimes it converges faster than Opus 4.6 (I've tried) because it doesn't over-think stuff.
If it were a person I would say it knows less, but it's still smart.
I mean, you don't need the most powerful tool at all times. We treat AI as one-size-fits-all, and once cost gets in the way, it will matter.

by isoprophlex

4 subcomments

Aren't these open models so cheap because they're (partially) chinese gov. sponsored, and because they're stealing and redistributing the IP that comes in?

by tuatoru

0 subcomment

Deepseek's price looks unsustainable. Ant have said their operating margin is 70%. A leaner company could maybe raise that to 90%.
Most of the cost of supplying inference compute is depreciation of the GPUs. Maybe Deepseek is anticipating a 50 year life for theirs.

by snootypoot

0 subcomment

i agree with his statement that the big companies and the string pullers in government are inching toward banning open models.allowing the plebs unrestricted access to things seems against the wishes of the "you will own nothing and be happy" / "you will rent everything on the cloud and subscribe to your appliances" crowd such as blackrock and so on.
anyone who disagrees is not seeing the forest, only the trees.

by kittikitti

0 subcomment

Even if open weight models were vastly more expensive, I would still prefer them. I don't know where my data is going and whether they're lying about the model when I make an API call. They can ban you from their API for any reason. Anthropic recently pulled their frontier models. There are numerous compliance concerns. The list goes on and on.

by cws_ai_buddy

0 subcomment

[flagged]

by nsoonhui

0 subcomment

[flagged]

0 subcomment