Mimo cost ~$400 at the old price, so about $40 today. Opus cost ~$5000
That's over 100x cheaper, and just 3 points behind.
I can't wait to experiment with an llm consortium of 100 deepseek and mimo models. Crazy times.
Shut up and take my m̶o̶n̶e̶y̶ data!
Edit: Gemini on google search told me I could write strikethrough text on hn using <s>. Mimo told me it was unsupported and then went on to list some tags that are supported, like <b>bold</b>. I tried copy pasting the word in strikethrough from a word processor but it lost the format. I ended up using mimo in an agent shell wrapper to produce it, and copy pasting from the terminal worked for some reason.
PS: Have not tried this but Deepseev4 Flash (not even Deepseekv4 Pro version) with set to "high" has pretty much Claud Opus 4.7 level of capabilities and is lightening fast and dirty cheap. Hours and hours of conversation barely costs few cents.
It's possible they've finally integrated cheap(er) chinese chips. It's also possible they're just subsidising inference for real-world usage data. Interesting either way.
From their docs "After using 10M input (cache miss) tokens of MiMo-V2.5-Pro, it is equivalent to consuming 3000M Credits, and you can still enjoy 1100M Credits of MiMo-V2.5". So it's around 12M input credit vs Earlier 60M tokens.
My plan was just upgraded to 38 BILLION tokens per month. That's at least 10X the tokens I've used in my entire agentic development so far.
I should probably downgrade my plan, but we'll see. :)
It's funny thinking the US companies are hiking prices and Chinese ones do the opposite, it's obviously an strategy, but pretty funny
Chinese models incidentally slurps up some terms that lead them to finding unflattering words that you wrote about the CCP in a random journal entry, or maybe a social media csv export. You go to China one day and are denied entry due to what you said.
Realistic or no? (yes i know the us is getting bad in re. to what you write online as well)
Models hosted in China are a siren call that I don't feel bad about resisting.
The question is how they are managing to do so? They are supposed to struggle due to chip sanctions.
Secondly, why now? The US companies were supposed to subsidize too but now they are unable to keep up. Everyone going to usage based pricing, so it's unsustainable for them. They are well funded too.
If there are genuine hardware breakthrough reducing compute needs then that is good for the whole world I believe.
as someone who now lives & has lived in the west for the majority of their adult life - yeah the US western models r fucked n the crazy valuations of the A.I labs - which also filters down to the economy - since all money instead of being put to productive use is being wasted on this shit. hell electricity bills are up - cz datacenters need power. the current crooks in power don't believe in clean energy.
This is why Anthropic wants these chinese AI models banned as they are in the lead in the AI race to zero and they know that there is no modal moat.
So don't tell Dario.