Codex can exploit vulnerable crypto smart contracts 72% of the time, raising urgent questions about AI-powered cyber offense and defense.
OpenAI introduces EVMbench to measure AI crypto security. Benchmark evaluates detection, patching and exploit skills. OpenAI has launched a benchmarking system called EVMbench to evaluate how ...
OpenAI and Paradigm unveil EVMbench, a benchmark testing AI agents on smart contract security across 120 high-severity vulnerabilities.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results