Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

LiveBench (Coding)

Assesses performance on LiveBench coding tasks from Leetcode and AtCoder, including both code generation and code completion challenges.
Source:

Model Performance

#3
77.9%
#4
77.5%
#5
76.7%
#6
76.1%
#7
75.3%
#13
73.6%
#15
73.2%
#17
72.9%
#18
71.8%
#19
71.4%
#20
70.8%
#22
68.5%
#23
64.2%
#24
63.5%
#25
62.9%
#26
61.5%
#27
60.6%
#29
58.8%
#30
54.5%
#31
54.3%
#33
53.2%