Then, I only switched from gpt-4 to gpt-5 because the price was cheaper lolz
Opus is also not the worst at hacking things either. Sometimes it hacks things 'by accident' you see. If Mythos is better at it, then at some point, yeah, I can see how that might start to become a problem. Especially running unsupervised.
So?
Modern software is designed with a defense in depth model, so it often requires chaining multiple vulnerabilities to get a successful exploit. But individual vulnerabilities still need finding and fixing because people might find vulnerabilities in the other isolation layers later.
I swear every time an LLM does something useful, the usual band of skeptics bends over backwards trying to invent reasons to dismiss it.
No reason to expect capabilities of models are going to stop.
In my view the naysayers always simply been moving the goalposts, and never admit when they were wrong. "AI just produces slop" -> "AI can't write useful code" -> "AI can't take SWE jobs" -> [we are here]
https://news.ycombinator.com/item?id=47717587
And here we learn that Mythos is not a big deal. Are there people who believe both?
One day I hear it is all a marketing pitch, another day I hear it can literally end earth so it should be regulated.
How do I reconcile this?