Post by Nethermind
25,193 followers
Nethermind supported SCABench, a community-driven benchmark with complete finding sets from competitive audit platforms. Our contribution: the open-sourced evaluation algorithm behind AuditAgent's benchmark results. Recall-only benchmarks can't measure noise. A team comparing two tools on EVMBench gets recall numbers for both. They discover the noise difference in production. The blog below covers what recall benchmarks structurally miss and how AuditAgent's validation pipeline addresses it. š https://lnkd.in/gQRJTDwG