Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

Chatbot Arena Vision

Evaluates multimodal models' visual understanding and reasoning capabilities through head-to-head comparisons in the Chatbot Arena platform, where human judges assess the quality, accuracy, and helpfulness of responses to image-based queries.
Source:

Model Performance