Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

IOI

The International Olympiad in Informatics (IOI) benchmark evaluates models on competitive programming problems from the 2024 and 2025 IOI competitions. Models must write C++ solutions at the level of elite high-school programmers, with access to execution environments and submission tools matching actual competition conditions. Unlike the saturating IMO benchmark, IOI provides clear differentiation between model capabilities with standardized, automated grading.
Source:

Model Performance