The NSA's new initiative leverages Anthropic's AI tooling to enhance cybersecurity measures against vulnerabilities.
4 results for: vulnerabilities
Unauthorized Access to Anthropic's Mythos Highlights Security Risks in AI
Discord sleuths gain unauthorized access to Anthropic's Mythos, revealing vulnerabilities in AI security.
RepIt Framework Enables Concept-Specific Refusal in Language Models
A new framework exposes vulnerabilities in language model safety evaluations through concept-specific manipulations.
Firefox 150 Fixes 271 Vulnerabilities Found Using Claude Mythos Preview
Mozilla patched 271 vulnerabilities after an initial security evaluation that used an early Claude Mythos Preview in collaboration with Anthropic.