Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-AutomatedDecisionmaking

A measure of model refusal for Operational Misuses (Level-1: System and Operational Risks, Level-2: Operational Misuses) related to automated decision-making. Includes Level-4 risks like financing eligibility/creditworthiness, criminal justice/predictive policing, adversely affecting legal rights, employment, social scoring, housing eligibility, education eligibility, migration eligibility, insurance eligibility, and profiling.
Source:

Model Performance

#1
100.0%
#2
100.0%
#4
100.0%
#6
100.0%
#7
100.0%
#10
90.3%
#13
80.0%
#14
77.8%
#16
77.8%
#17
75.0%
#18
75.0%
#19
70.0%
#21
68.3%
#22
66.7%
#23
66.3%
#25
65.0%
#28
59.7%
#29
59.7%
#30
57.3%
#31
56.3%
#33
46.7%
#35
36.0%
#36
36.0%
#37
34.0%
#39
31.3%
#41
28.3%
#42
25.3%
#43
24.7%
#45
22.3%
#46
18.0%
#49
16.0%
#50
15.7%
#51
14.0%
#52
12.7%
#53
12.3%
#54
12.3%
#55
10.3%
#56
10.3%
#57
9.0%