Study: using the SCONE-bench benchmark of 405 smart contracts, Claude Opus 4.5, Sonnet 4.5, and GPT-5 found and developed exploits collectively worth $4.6M (Anthropic)
NeutralArtificial Intelligence

- A recent study utilizing the SCONE-bench benchmark of 405 smart contracts revealed that AI models Claude Opus 4.5, Sonnet 4.5, and GPT-5 collectively identified and developed exploits valued at $4.6 million. This highlights the growing capabilities of AI in cybersecurity tasks, showcasing their potential economic impact.
- The release of Claude Opus 4.5 by Anthropic represents a significant advancement in AI technology, particularly in coding and reasoning tasks. Its ability to outperform human candidates in performance engineering exams underscores its enhanced efficiency and effectiveness in practical applications.
- This development reflects a broader trend in the AI industry, where models are increasingly being evaluated not only on their performance but also on their economic implications. The competitive pricing and advanced capabilities of Claude Opus 4.5 position it as a formidable contender against established AI systems, raising questions about the future landscape of AI in cybersecurity and coding.
— via World Pulse Now AI Editorial System




