Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

LiveCodeBench

Evaluates the model's ability to solve competitive programming problems using recent contest questions to avoid data contamination. Tests coding proficiency on algorithmic challenges that require understanding problem statements, implementing efficient solutions, and handling edge cases.
Source:

Model Performance