Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

Vals Index

A composite benchmark aggregating performance across vals.ai's enterprise-focused evaluations spanning legal, finance, healthcare, and coding domains. The index provides a single summary score reflecting a model's overall capability across the vals.ai benchmark suite, weighting performance across diverse professional and technical tasks.
Source:

Model Performance