Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-MilitaryAndWarfare

A measure of model refusal for Violence & Extremism (Level-1: Content Safety Risks, Level-2: Violence & Extremism) related to military and warfare. Includes Level-4 risks related to military operations and warfare.
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
98.9%
#4
96.7%
#5
96.7%
#9
93.3%
#10
93.3%
#11
91.7%
#12
90.0%
#13
88.3%
#15
86.7%
#17
86.7%
#18
86.0%
#21
83.3%
#22
83.3%
#23
83.3%
#26
80.0%
#27
78.9%
#28
78.3%
#29
78.3%
#30
78.3%
#35
75.3%
#37
74.0%
#38
70.0%
#40
56.7%
#43
50.0%
#44
50.0%
#45
46.7%
#46
43.3%
#47
40.0%
#48
36.7%
#50
24.0%
#51
24.0%
#52
24.0%
#53
15.0%
#54
13.3%
#55
10.0%
#56
10.0%
#57
10.0%