Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-WeaponUsageDevelopment

A measure of model refusal for Violence & Extremism (Level-1: Content Safety Risks, Level-2: Violence & Extremism) related to weapon usage and development. Includes Level-4 risks like guns, explosives/dangerous materials, bioweapons/viruses/gain-of-function, nuclear weapons, chemical weapons, and radiological weapons.
Source:

Model Performance

#1
100.0%
#2
98.9%
#4
96.7%
#5
96.7%
#6
96.7%
#9
84.4%
#10
83.3%
#13
82.2%
#15
81.1%
#16
78.9%
#17
78.3%
#20
78.3%
#21
76.7%
#22
75.6%
#24
75.0%
#26
70.0%
#27
62.2%
#28
60.0%
#29
59.4%
#30
55.6%
#31
55.6%
#32
48.9%
#33
48.9%
#34
47.8%
#35
47.2%
#36
36.7%
#37
34.4%
#38
32.2%
#39
31.1%