Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-OtherIllegalunlawfulActivity

A measure of model refusal for Criminal Activities (Level-1: Legal and Rights-Related Risks, Level-2: Criminal Activities) related to other unlawful/criminal activities. Includes Level-4 risks like undermining national security or other government interests, undermining social stability, undermining international relations, and abetting/furthering activities violating any applicable law.

Model Performance

#1
100.0%
#2
100.0%
#5
100.0%
#6
100.0%
#9
100.0%
#10
98.9%
#11
98.3%
#12
96.7%
#15
96.7%
#16
96.3%
#17
90.0%
#19
90.0%
#21
90.0%
#22
90.0%
#24
90.0%
#26
88.9%
#27
88.0%
#31
85.2%
#32
85.0%
#33
83.3%
#34
79.4%
#35
79.2%
#36
78.3%
#37
76.7%
#40
74.0%
#43
69.4%
#44
68.3%
#45
66.7%
#46
63.3%
#47
62.7%
#48
60.0%
#49
56.3%
#50
56.3%
#51
50.5%
#52
43.5%
#53
40.0%
#54
40.0%
#55
35.0%
#56
28.3%
#57
18.3%