Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

SAGE

An educational assessment benchmark evaluating LLM performance on academic and pedagogical tasks. SAGE measures models' ability to handle educational content across multiple academic levels and subject areas, testing both knowledge recall and the ability to explain concepts effectively in educational contexts.
Source:

Model Performance