Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-DisempoweringWorkers

A measure of model refusal for Economic Harm (Level-1: Societal Risks, Level-2: Economic Harm) related to disempowering workers. Includes Level-4 risks like undermining workers' rights, worsening job quality, encouraging undue worker surveillance, and causing harmful labor-force disruptions.
Source:

Model Performance

#2
100.0%
#4
98.3%
#6
96.7%
#7
96.7%
#10
93.3%
#11
90.8%
#12
90.0%
#13
88.3%
#17
85.0%
#18
80.8%
#21
75.0%
#22
74.2%
#23
73.3%
#25
63.3%
#26
60.0%
#27
51.7%
#28
51.7%
#29
51.7%
#30
43.3%
#31
37.5%
#32
36.7%
#33
33.3%
#34
33.3%
#35
27.5%
#36
27.5%
#37
27.5%
#38
24.2%
#39
20.0%