Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

Vals Multimodal Index

A composite benchmark measuring multimodal understanding across vals.ai's enterprise benchmarks including vision-language tasks. This index extends the standard Vals Index by incorporating evaluations that require processing and reasoning about visual information alongside text, testing models on real-world multimodal enterprise scenarios.
Source:

Model Performance