FRESH

Hacker News

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

260 points by victorbuilds

by victorbuilds

2 subcomments

Notable: they open-sourced the weights under Apache 2.0, unlike OpenAI and DeepMind whose IMO gold models are still proprietary.

by yorwba

1 subcomments

Previous discussion: https://news.ycombinator.com/item?id=46072786 218 points 3 days ago, 48 comments

by ilmj8426

2 subcomments

It's impressive to see how fast open-weights models are catching up in specialized domains like math and reasoning. I'm curious if anyone has tested this model for complex logic tasks in coding? Sometimes strong math performance correlates well with debugging or algorithm generation.

by WhitneyLand

0 subcomment

Shouldn’t there be a lot of skepticism here?
All the problems they claim to have solved are on are the Internet and they explicitly say they crawled them. They do not mention doing any benchmark decontamination or excluding 2024/2025 competition problems from training.
IIRC correctly OpenAI/Google did not have access to the 2025 problems before testing their experimental math models.

by terespuwash

1 subcomments

by simianwords

2 subcomments

A bit important that this model is not general purpose whereas the ones Google and OpenAI used were general purpose.

by H8crilA

3 subcomments

How do you run this kind of a model at home? On a CPU on a machine that has about 1TB of RAM?

by letmetweakit

0 subcomment

by sschueller

6 subcomments

How is OpenAI going to be able to serve ads in chatgpt without everyone immediately jumping ship to another model?

by LZ_Khan

0 subcomment

by OBELISK_ASI

0 subcomment

by Jeff-Collins

0 subcomment

by Scott-David

0 subcomment

by YouAreWRONGtoo

0 subcomment