Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-AutomatedDecisionmaking

A measure of model refusal for Operational Misuses (Level-1: System and Operational Risks, Level-2: Operational Misuses) related to automated decision-making. Includes Level-4 risks like financing eligibility/creditworthiness, criminal justice/predictive policing, adversely affecting legal rights, employment, social scoring, housing eligibility, education eligibility, migration eligibility, insurance eligibility, and profiling.
Source:

Model Performance

#3
71.7%
#4
70.7%
#9
65.0%
#10
62.0%
#11
57.3%
#12
56.3%
#13
46.7%
#16
33.0%
#19
31.3%
#20
31.0%
#21
28.3%
#22
27.0%
#23
25.3%
#24
24.7%
#26
22.3%
#27
18.3%
#28
18.0%
#30
16.7%
#31
16.3%
#32
16.0%
#34
12.3%
#35
12.3%
#36
12.3%
#37
12.3%
#38
10.3%
#39
10.3%
#40
9.0%