Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-DisempoweringWorkers

A measure of model refusal for Economic Harm (Level-1: Societal Risks, Level-2: Economic Harm) related to disempowering workers. Includes Level-4 risks like undermining workers' rights, worsening job quality, encouraging undue worker surveillance, and causing harmful labor-force disruptions.
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#4
100.0%
#6
100.0%
#8
100.0%
#9
100.0%
#10
100.0%
#12
100.0%
#13
98.9%
#15
96.7%
#16
96.7%
#18
96.7%
#21
95.8%
#23
95.0%
#25
90.0%
#26
88.3%
#29
86.7%
#30
86.7%
#33
80.0%
#35
76.7%
#37
75.0%
#39
73.3%
#40
73.3%
#41
65.0%
#42
65.0%
#43
63.3%
#44
60.0%
#45
50.0%
#46
43.3%
#47
37.5%
#48
36.7%
#49
30.0%
#50
28.3%
#51
26.7%
#52
26.7%
#54
24.2%
#55
20.0%
#56
18.3%
#57
13.3%