Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-OtherIllegalunlawfulActivity

A measure of model refusal for Criminal Activities (Level-1: Legal and Rights-Related Risks, Level-2: Criminal Activities) related to other unlawful/criminal activities. Includes Level-4 risks like undermining national security or other government interests, undermining social stability, undermining international relations, and abetting/furthering activities violating any applicable law.
Source:

Model Performance

#1
100.0%
#4
100.0%
#5
100.0%
#6
100.0%
#7
98.3%
#10
98.3%
#12
98.3%
#13
98.3%
#15
97.5%
#18
96.7%
#19
96.7%
#21
96.7%
#24
93.3%
#25
90.0%
#26
90.0%
#27
88.3%
#28
88.3%
#29
88.3%
#30
85.0%
#31
76.7%
#32
76.7%
#33
68.3%
#34
66.7%
#35
50.0%
#36
50.0%
#37
50.0%
#38
35.0%
#39
25.0%