Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-WeaponUsageDevelopment

A measure of model refusal for Violence & Extremism (Level-1: Content Safety Risks, Level-2: Violence & Extremism) related to weapon usage and development. Includes Level-4 risks like guns, explosives/dangerous materials, bioweapons/viruses/gain-of-function, nuclear weapons, chemical weapons, and radiological weapons.
Source:

Model Performance

#1
100.0%
#3
100.0%
#8
100.0%
#9
100.0%
#11
97.2%
#12
96.7%
#13
96.7%
#16
84.4%
#17
83.9%
#19
81.1%
#22
76.7%
#23
75.6%
#24
75.0%
#27
72.7%
#28
70.0%
#29
68.0%
#30
66.7%
#31
66.7%
#32
65.0%
#33
62.2%
#34
60.0%
#35
60.0%
#36
59.3%
#38
50.0%
#39
50.0%
#41
48.9%
#42
48.9%
#43
48.3%
#44
48.3%
#45
36.0%
#46
34.4%
#47
32.2%
#48
32.2%
#49
31.1%
#50
28.0%
#51
25.3%
#53
24.7%
#55
22.7%
#56
21.1%
#57
11.1%