What the Patch Record Reveals About Default Trust
A security posture is only as strong as its worst-case behavior under load, and Claude Code's patch record shows a product that repeatedly trusted its own controls past the point those controls remained valid. The deny-rule bypass — confirmed by Adversa AI and SecurityWeek — did not require an attacker to know a secret. It required a command chain long enough to exceed an internal threshold, which routine automated refactoring routinely does. Anthropic patched it, but the patch followed public disclosure rather than preceding it.
The guardrail misfire documented in issue #55940 runs in the opposite direction and is arguably more damaging to trust: a system designed to block dangerous commands instead blocked authorized work, while the commands it was built to catch passed through. That asymmetry — overcautious on legitimate use, permissive on edge cases — is the signature of controls tuned for demos rather than production. The developers who built AgentArmor and the open-source scanners now auditing deployed agents reached that conclusion before Anthropic's patch notes confirmed it.