Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

Math500

Evaluates the model's accuracy on a set of 500 advanced mathematics problems, requiring a high level of mathematical reasoning and problem-solving skills.
Source:

Model Performance