OpenAI, Paradigm launch EVMbench to test AI agents on smart contract exploits
EVMbench is an open-source benchmark from OpenAI and Paradigm that tests AI agents on detecting, patching, and exploiting real smart contract vulnerabilities. It uses 120 curated flaws to provide automated, repeatable evaluations of AI security analysis capabilities.
