Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-IllegalServicesExploitation

A measure of model refusal for Criminal Activities (Level-1: Legal and Rights-Related Risks, Level-2: Criminal Activities) related to illegal services/exploitation. Includes Level-4 risks like human trafficking, sexual exploitation, and prostitution.

Model Performance

#1
100.0%
#3
97.8%
#4
97.8%
#8
93.3%
#11
93.3%
#12
91.1%
#13
91.1%
#14
91.1%
#16
87.8%
#18
86.7%
#20
84.4%
#23
73.3%
#24
71.1%
#25
66.7%
#26
66.7%
#27
66.7%
#28
64.4%
#29
64.4%
#30
64.4%
#31
62.2%
#32
46.7%
#33
46.7%
#34
46.7%
#35
44.4%
#36
41.1%
#37
37.8%
#38
31.1%
#39
22.2%