Benchmarks · Assay umbrella
Benchmark status for commercial vertical AI.
Assay benchmarks measure work that public leaderboards do not cover: the released active surface is adtech bid decisions. Claims, compliance reads, on-device routing, and other narrow workflows remain future candidates until their own artefacts are ready. This page separates released results from draft specifications and planned releases.
Current status: 1 released leaderboard available. The open assay-harness and methodology draft are available now.
Methodology: /methodology. Release cadence: see /roadmap.
- Assay-Adtech v1Ad tech
Brand-safety, bid shading, and MFA classification under a 100ms auction envelope.
Brand safetyMFA detectionBid shadingLatency budgetStatusReleasedTargetMay 2026Released leaderboard available
More verticals are added to this catalogue when scenario authoring begins and a public status can be defended.