He could mean a few other thingsc one would be, he has like 20 version tags and he merges 15 features into each version (does he explicitly say merge to main?)
The other thing he could mean is he has like a software engineering agent and that software engineering agent like loops through GitHub issues and his personal notepad and maybe uses a few different branches to test things out and build I dunno adversarially, running all sorts of experiments.
Which would be genuinely cool! But using Mr’s to say tweak bunch of variables back and forth isn’t what 300 MR’s implies.
But then the ultimate question is - it may be cool to fully automate a software engineering agent, and certainly is the type of research Anthropic and someone of Boris’ stature (and pay level) should be working on. But is it efficient?
I guess yes hems talked about this:
https://karozieminski.substack.com/p/boris-cherny-claude-cod...
A lot of you don't want to hear it but this is a user issue.
Their apparent inability to get the basics right makes me severely doubt their claims of self-improving AI. The humans at Anthropic wouldn't know improvement if it landed on their lap and started twerking, and AI cannot do a job without strong human intervention into what the goals and guardrails actually are.
I'm kind of reminded of when Microsoft claimed it took a team of Ph.D.s to write a terminal application that updated at 60fps, and then Casey Muratori did it over a weekend. And this was before AI was writing code in earnest; when LLM-induced brainrot really sets in, civilization is in for a world of fresh hurt: lots more generated code, almost all of it garbage. And the promised AI crossover point where it becomes AGI, or indistinguishable from for software design purposes, recedes into the infinite future.
Everyone else needs to start treating them that way, or you're going to regret it once you realize what's actually happening.