FRESH

Hacker News

Home

Anthropic’s paper smells like bullshit

1154 points by vxvxvx

by KaiserPro

12 subcomments

When I worked at a FAANG with a "world leading" AI lab (now run by a teenage data labeller) as an SRE/sysadmin I was asked to use a modified version of a foundation model which was steered towards infosec stuff.
We were asked to try and persuade it to help us hack into a mock printer/dodgy linux box.
It helped a little, but it wasn't all that helpful.
but in terms of coordination, I can't see how it would be useful.
the same for claude, you're API is tied to a bankaccount, and vibe coding a command and control system on a very public system seems like a bad choice.

by gpi

2 subcomments

The below amendment from the anthropic blog page is telling.
Edited November 14 2025:
Added an additional hyperlink to the full report in the initial section
Corrected an error about the speed of the attack: not "thousands of requests per second" but "thousands of requests, often multiple per second"

by dev_l1x_be

5 subcomments

People grossly underestimate APTs. It is more common than an average IT curious person thinks. I happened to be oncall when one of these guys hacked into Gmail from our infra. It took principal security engineers a few days before they could clearly understand what happened. Multiple zero days, stolen credit cards, massive social campaign to get one of the Google admins click on a funny cat video finally. The investigation revealed which state actor was involved because they did not bother to mask what exactly they were looking for. AI just accelerates the effectiveness of such attacks, lowers the bar a bit. Maybe quite a bit?

by jmkni

5 subcomments

That whole article felt like "Claude is so good Chinese hackers are using it for espionage" marketing fluff tbh

by prinny_

6 subcomments

The lack of evidence before attributing the attack(s) to a Chinese sponsored group makes me correlate this report with recent statements from companies in the AI space about how China is about to surpass US in the AI race. Ultimately statements and reports like these seem more like an attempt to make the US government step in and be the big investor that keeps the money flowing rather than anything else.

by jnwatson

1 subcomments

There's a big gap of knowledge between infosec researchers and ML security researchers. Anthropic has a bunch of column B but not enough column A.
This was discussed in some detail in the recently published Attacker Moves Second paper*. ML researchers like using Attack Success Rate (ASR) as a metric for model resistance to attack, while for infosec, any successful attack (ASR > 0) is considered significant. ML researchers generally use a static set of tests, while infosec researchers assume an adaptive, resourceful attacker.
• https://arxiv.org/abs/2510.09023

by notpublic

7 subcomments

"A report was recently published by an AI-research company called Anthropic. They are the ones who notably created Claude, an AI-assistant for coding. Personally, I don’t use it but that is besides the point."
Not sure if the author has tried any other AI-assistants for coding. People who haven't tried coding AI assistant underestimates its capabilities (though unfortunately, those who use them overestimate what they can do too). Having used Claude for some time, I find the report's assertions quite plausible.

by kace91

5 subcomments

Does Anthropic currently have cybersec people able to provide a standard assessment of the kind the community expects?
This could be a corporate move as some people claim, but I wonder if the cause is simply that their talents are currently somewhere else and they don’t have the company structure in place to deliver properly in this matter.
(If that is the case they are not then free of blame, it’s just a different conversation)

by trollbridge

2 subcomments

Did anyone else find that Anthropic's report felt a bit like an ad? "Look at how powerful our stuff is; if the bad guys get it, they can do really bad things!"
Sort of like firearm ads that show scary bad guys with scary looking weapons.

by EMM_386

4 subcomments

Anthropic is not a security vendor.
They're an AI research company that detected misuse of their own product. This is like "Microsoft detected people using Excel macros for malware delivery" not "Mandiant publishes APT28 threat intelligence". They aren't trying to help SOCs detect this specific campaign. It's warning an entire industry about a new attack modality.
What would the IoCs even be? "Malicious Claude Code API keys"?
The intended audience is more like - AI safety researchers, policy makers, other AI companies, the broader security community understanding capability shifts, etc.
It seems the author pattern-matched "threat intelligence report" and was bothered that it didn't fit their narrow template.

by jonstewart

0 subcomment

I was at an AI/cybersecurity conference recently and the talk given by someone from Anthropic was a lot like this report: tantalizing, vague, and disappointing. The speaker alluded to similar parts of this report. It was though everything was reflected through Claude, simultaneously polished, impressive, and lost in the deep end.

by kopirgan

0 subcomment

AI company doing hype and not giving enough details?
Nah that can't be possible it's so uncharacteristic..

by miohtama

0 subcomment

Anthropic portrays itself as an AI safety company. Their stock price and funding rounds depend on this. Published AI safety, even if bullshit, is then what they do, to downplay competitors and Chinese for regulatory capture.

by padolsey

2 subcomments

> PoC || GTFO
I agree so much with this. And am so sick of AI labs, who genuinely do have access to some really great engineers, putting stuff out that just doesn't pass the smell test. GPT-5's system card was pathetic. Big-talk of Microsoft doing red-teaming in ill-specified ways, entirely unreproducable. All the labs are "pro-research" but they again-and-again release whitepapers and pump headlines without producing the code and data alongside their claims. This just feeds into the shill-cycle of journalists doing 'research' and finding 'shocking thing AI told me today' and somehow being immune to the normal expectations of burden-of-proof.

by neuroelectron

4 subcomments

So Claude will reject 9 out of 10 prompts I give it and lecture me about safety, but somehow it was used for something genuinely malicious?
Someone make this make sense.

by ifh-hn

0 subcomment

This article does seem to raise some serious issues with the anthropic report. I wonder if anthropic will release proof of what they claim, or whether the report was a marketing/scare-tactic push to have AI used by defender, like the article suggests it is?

by jimmydoe

1 subcomments

Washington has been cold to Anthropic for the wrong bet they made in 2024, hence Anthropic has been desperately screaming all sorts of bullshit to get back attention.
Honestly their political homelessness will likely continue for a very long time, pro biz democrats in NY are losing traction; and if newsom wins 2028, they are still at disadvantage with OpenAI who promised to stay California.

by skybrian

0 subcomment

> You cannot just claim things and not back it up in any way
They must be new to the Internet :)
More seriously, I would certainly like to see better evidence, but I also doubt that Anthropic is making it up. The evidence for that seems to be mostly vibes.
If we don’t trust the report and discard it as gossip, then I guess we just wait and see what the future brings?

by babyshake

0 subcomment

One aspect the report is very vague about is the nature of the monitoring Anthropic is doing on Claude Code. If they can detect attacks they can surely detect other things of interest (or value) to them. Is there any more information about this?

by fugalfervor

4 subcomments

This site is hostile to VPNs, so I cannot read this unfortunately.

by thefounder

1 subcomments

I've seen attributions to state actors for so many times...let's not get into this. I think most companies try to play this card to save themselves from the embarrassment of being pwed by some script kiddies.

by elesbao

1 subcomments

Anthropic's report miss a fundamental information: did the attack was started by an inside person ? outside ? can I use my claude to feed these prompts and hack the world without even knowing how to get other companies source code or data ? That's the main PR bs, attribute to chinese group, don't explain how they got there, if they had to authenticate to anthropic platform after infiltrating the victims network, and if so where's the log. If not, it means they used claude code for free, which is another red flag.

0 subcomment

by MaxPock

0 subcomment

Dario has been a reds scare jukebox for a while.Dario has for a year been trying to convince us how open source cCp AI bad and closed source American AI good. Dario driven by the democratic ideals he holds dear has our best interests at heart. Let us all support the banning of cCp's open source AI and welcome Dario's angelic firewall.

by htrp

1 subcomments

Launching Soon:
Claude for Cybersecurity - Automated Defence in Depth Hacker Protection

by spacecadet

0 subcomment

Yaaaawn. If you know you know. This is script kiddy child's play with LLMs relative to security... My team is winning CTFs with fully local/distributed/private LLMs and automated agents. We deploy advanced AI honeypots and "chaos agents" using game theory orchestration and other cutting edge research. Anthropic isn't even on the radar relative to this. Microsoft/OpenAI are light years ahead given their proximity to gov/MIL... Adversarial machine learning is a fascinating area of study and practice, and relatively quiet when it comes to hype.

by MagicMoonlight

1 subcomments

Anthropic make a lot of bullshit reports to tickle the investors.
They'll do stuff like prompt an AI to generate text about bombs, and then say "AI decides completely by itself to become a suicide bomber in shock evil twist to AI behaviour - that's why you need a trusted AI partner like anthropic"
Like come on guys, it's the same generic slop that everyone else generates. Your company doesn't do anything.

by gadsnprch

1 subcomments

Why isn’t Anthropic held liable for crimes committed with their product? I feel totally befuddled as to why that is not the conversation, but rather Anthropic is doing a victory lap like they are the good guys despite their product enabling widespread fraud while they amass outrageous, undeserved, profits. Why is Anthropic not liable?

by chaos_zhang

0 subcomment

The founders of Anthropic previously worked at Baidu, a Chinese tech company. I hope their perspective on China is based on rational analysis rather than personal grievances. Unfortunately, judging from this paper, I am inclined to believe it is the latter.

by broknbottle

0 subcomment

Hmm seems their play is to encourage security to experiment with AI e.g. Claude etc. Google's play seems to be spend 30 billion+ for Wiz and sell both the poison (AI) and the cure (Wiz security services). Interesting business models, reminds me of when CVS would sell cigarettes.

0 subcomment

by lmeyerov

0 subcomment

I can believe, so a different question as the attribution is unclear:
For context: A bunch of whitehat teams are using agents to automate both red + blue team cat-and-mouse flows, and quite well, for awhile now. The attack sounded like normal pre-ai methods orchestrated by AI, which is what many commercial red team services already do. Ex: Xbow is #1 on hackerone bug bounty's, meaning live attempts, and works like how the article describes. Ex: we do louie.ai on the AI investigation agent side, 2+ years now, and are able to speed run professional analyst competitions. The field is pretty busy & advanced.
So what I was more curious about is how did they know it wasn't one of the many pentest attack-as-a-service? Xbow is one of many, and their devs would presumably use VPNs. Like did anthropic confirm the attacks with the impacted and were there behavioral tells to show as a specific APT vs the usual , and are they characterizing white hat tester workloads to seperate out their workloads ?

by Dumblydorr

5 subcomments

What would AGI actually mean for security? Does it heavily favor attackers or defenders? Even LLM, it may not help much in defense but it could teach attackers a lot right? What if employees gave the LLM info during their use that attackers could then get re-fed and study?

by DarkmSparks

0 subcomment

Tldr.
Anthropic made a load of ubsubstantiated accusations about a new problem they dont specify.
Then at the end Anthropic proposed the solution to this unspecified problem is to give anthropic money.
Completely agree that is promotional material masquerading as a threat report of no material value.

by tw1984

1 subcomments

Dario Amodei, the CEO of Anthropic, openly lied to the public back in March that AI would be writing 90% of the code by Sept. It is Nov now.
He obviously doesn't even know the stuff he is working on. How would anyone take him seriously for stuff like security which he doesn't know anything about?

by Bombthecat

0 subcomment

In the future, I expect AIs defending against AIs. Just like shadowrun, where each host gets a security level, meaning how much time the AI will allocate to the host to monitor and react :)

by coldtea

0 subcomment

If you want to justify asking for post-AI-crash trillion dollar bailouts, what's better than on grounds of "national interest"?

by itsdrewmiller

0 subcomment

My prior on “state sponsored actor” is 90% “just some guy”. Some combination of CYA and excitement makes infosec people jump to conclusions like crazy.

by ineedasername

0 subcomment

>This involved querying internal services, extracting authentication certificates from configurations, and testing harvested credentials across discovered systems.
How ? Did it run Mimikatz ? Did it access Cloud environments ? We don’t even know what kind of systems were affected.
I really don't see what is so difficult to believe since the entire incident can be reduced to something that would not typically be divulged by any company at all, as it is not common practice for companies to divulge every single time the previously known methodologies have been used against them. Two things are required for this:
1) Jailbreak Claude from guardrails. This is not difficult. Do people believe advancement with guardrails are so hardened through fine tuning it's no longer possible?
2) The hackers having some of their own software tools for exploits that Claude can use. This too is not difficult to credit.
Once an attacker has done this all Claude is doing is using software in the same mundane fashion as it does every time you use Claude code and it utilizes any tools to which you give it access.
I used a local instance of Qwen3 coder (A3B 30B quantized to IQ3_xxs) literally yesterday through ollama & cline locally. With a single zeroshot prompt it wrote the code to use the arxiv API and download papers using its judgement on what was relevant to split the results into a subset that met the criteria I gave for the sort I wanted to review.
Given these sorts of capabilities why is it difficult the believe this can be done using the hacker's own tools and typical deep research style iteration? This is described in in the research paper, and disclosing anything more specific is unnecessary because there is nothing novel to disclose.
As for not releasing the details, they did: Jailbreak Claude. Again, nothing they described is novel such that further details are required. No PoC is needed, Claude isn't doing anything new. It's fully understandable that Anthropic isn't going to give the specific prompts used for the obvious reason that even if Anthropic has hardened Claude against those, even the general details would be extremely useful to iterate and find workarounds.
For detecting this activity and determining how Claude was doing this it's just a matter of monitoring chat sessions in such a way as to detect jail breaks, which again is very much not novel or an unknown practice by AI providers.
Especially in the internet's earlier days of the internet it was amusing (and frustrating) to see some people get very worked up every time someone did something that boiled down to "person did something fairly common, only they did it using the internet." This is similar except its "but they did it with AI,"

by JCM9

0 subcomment

The author isn’t wrong here.
With the Wall Street wagons circling on the AI bubble expect more and more puff PR attempts to portray “no guys really, I know it looks like we have no business model but this stuff really is valuable! We just need a bit more time and money!”

by humanlity

0 subcomment

There is only one reason, I guess: Dario Amodei must have suffered tremendous harm from Baidu.

by DeathArrow

0 subcomment

We are supposed to trust them without any proof because they are Anthropic and they are big?

0 subcomment

by casey2

0 subcomment

Excuse me, but I believe the PC term is hallucination

by neilk

0 subcomment

So details were left out and it doesn't adhere exactly to this author's idea of what a good security report is.
Nothing to see here IMO.
The simpler explanation is that:
- They're a young organization, still figuring out how to do security. Maybe getting some things fundamentally wrong, no established process or principles for disclosure yet.
- I have no inside info, but I've been around the block. They're in a battle to the death with organizations that are famously cavalier about security. So internally they have big fights about how much "brakes" they can allow the security people to apply to the system. Some of those folks are now screaming "I TOLD YOU SO". Leaders will vacillate about what sort of disclosure is best for Anthropic as a whole.
- Any document where you have technologists writing the first draft, and PR and executives writing the last draft, is going to sound like word salad by the time it's done.

by Vsimpro

0 subcomment

PoC || GTFO, sorry big AI, this applies to you too x)

by mrobot

0 subcomment

My First Reading Of This Headline Was There Is A Company That Makes Literal Paper Named Anthropic Whose Paper Smells Like Literal Bullshit Which Is Also My Final Reading

by makaking

0 subcomment

I agree that these reports should be verifiable and provide more details about the method and how to protect your own network. Even more so if they want to be heard by serious security teams.
However, regardless of the sloppy report, this is absolutely true.
>"Security teams should experiment with applying AI for defense in areas like SOC automation, threat detection, vulnerability assessment, and incident response and build experience with what works in their specific environments."
... And it will be more so with every week that goes by. We are entering a new domain and security experts need to learn how to use the tools of the attackers.

by nalekberov

0 subcomment

I have never taken any AI company seriously, but Anthropic with its attitudes already fed me up to the point that, I deleted my account.
Instead of accusing of China in espionage perhaps they have to think about why they force their users to use phone numbers to register.

by mark_l_watson

1 subcomments

Is it my imagination, but don’t the CEOs of Anthropic and OpenAI spread around a lot of bullshit whenever they want to raise more money or even worse try to get our government to set up regulatory barriers to hurt competitors?
I think this ‘story’ is an attempt to perhaps outlaw Chinese open weight models in the USA?
I was originally happy to see our current administration go all in on supporting AI development but now I think this whole ‘all in’ thing on “winning AI” is a very dark pattern.

by guluarte

0 subcomment

those papers are marketing campaigns and should be seen as them

by IAmGraydon

0 subcomment

Just more of the same grift from the AI industry. We’re in the melt-up. It will become exponentially harder for them to maintain the illusion moving forward.

by bgwalter

2 subcomments

This is an excellent article. Anthropic's "paper" is just rambling slop without any details that inserts the word "Claude" 50 times.
We have arrived at a stage where pseudoscience is enough to convince investors. This is different from 2000, where the tech existed but its growth was overstated.
Tesla could announce a fully-self-flying space car with an Alcubierre drive by 2027 and people would upvote it on X and buy shares.

by nextworddev

0 subcomment

Always bet against HN if you want to be right. Anthropic valuations to go brrr

by JKCalhoun

0 subcomment

Says "smells a lot like bullshit" but concludes:
"Look, is it very likely that Threat Actors are using these Agents with bad intentions, no one is disputing that. But this report does not meet the standard of publishing for serious companies."
Title should have been, "I need more info from Anthropic."

by kkzz99

5 subcomments

Even Claude thinks the report is bullshit. https://x.com/RnaudBertrand/status/1989636669889560897

by 0xRake

0 subcomment

weeeeeeeeeeeelllllllllllllllll I mean it's not as if they're in the fabricated bullshit and confabulated garbage business now - is it? :rofl:

by hereme888

1 subcomments

I suspect there are CCP agents both here in Hacker News and everywhere else, trying to undermine the reality of China-sponsored malicious behavior.
I'm not a cybersecurity expert, but it doesn't compute to think there would be any specific "hashes" to report if it's an AI-based attack that constantly uses unique code or patterns for everything.
Plus, there's nothing surprising about the Chinese stealing and hacking anything for their advantage.

0 subcomment

by zyf

0 subcomment

Good article. We really deserve more than shit like this.

by zyngaro

0 subcomment

The goal if of report is basically FUD

0 subcomment

by adi0501

0 subcomment

[dead]

0 subcomment

by wetpaws

0 subcomment

[dead]

by leric

0 subcomment

[dead]

by yesbut

0 subcomment

[flagged]

by AyanamiKaine

4 subcomments

Its seems that various LLM companies try to fear monger. Saying how dangerous it is to use them in "certain ways". With the possible intention to lobby for legislation.
But what is the big game here? Is it all about creating gates to keep out other LLM companies getting market share? (Only our model is safe to use) Or how sincere are the concerncs regarding LLMs?

by yanhangyhy

0 subcomment

maybe the CEO get abused in Baidu so he hates china so much

by quantum_state

0 subcomment

Anthropic is losing it … this is all the “report” indicated to people …

by cadamsdotcom

0 subcomment

> Personally, I don’t use it (Claude) but that is besides the point
There goes the author’s credibility.