OpenAI and Paradigm Launch EVMbench to Test AI Agents on Ethereum Smart Contract Security
OpenAI and crypto investment firm Paradigm unveiled EVMbench, a benchmark tool designed to evaluate how AI agents detect, patch, and exploit vulnerabilities in Ethereum smart contracts. Drawing on 120 real flaws from 40 audits, the tool tests models like GPT-5.3-Codex, which scored 72.2% in exploit mode, as billions in crypto assets remain at risk.