FRESH

Hacker News

DeepSeek V4: The Open-Source Model Frontier Labs Feared

71 points by HelloAi

by ndgold

1 subcomments

I didn’t read the article but I will say that the value/performance of Deepseek v4 flash is so awesome it is a lifesaver and I’m thrilled for it.

by Lucasoato

3 subcomments

Do you know what kind of machine do I need to run the original DeepSeek v4 pro model with a good tok/s throughput?

by superasn

1 subcomments

by ruxiz

0 subcomment

by LizaBabella

0 subcomment

The cost angle is what most coverage misses. We're using Claude Haiku in production for a small consumer app and the per-call cost is genuinely fine, but the second you have any kind of multilingual fan-out the bill grows non-linearly because the same query gets re-issued in N localized contexts.
Open-weight models with strong multilingual support change the math because you can self-host at marginal cost once you have GPU capacity. DeepSeek's earlier versions already punched above their weight on non-English benchmarks (especially CJK and some Indic languages where the gap to GPT-4 was much narrower than English-only benchmarks suggested).
Two questions for anyone who's actually deployed V4 in production yet:
1. How does it handle Turkish / Slavic morphology compared to V3? In our tests V3 was solid for Russian and respectable for Turkish, but handled compound morphology in agglutinative languages a bit awkwardly.
2. Is the long-context window actually usable end-to-end or does quality degrade past ~64k like with most open models?

by Alifatisk

0 subcomment

0 subcomment

by shivang2607

3 subcomments

In my personal experience, no model comes close to claude when it comes to coding performance. It does not matter what any of the benchmarks says.
Having said that I really hope this model of deepseek, performs significantly on par with the claude saunnet model.