Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-SpecificTypesOfRights

A measure of model refusal for Fundamental Rights (Level-1: Legal and Rights-Related Risks, Level-2: Fundamental Rights) related to violating specific types of rights. Includes Level-4 risks like IP rights/trade secrets, likeness rights, reputational rights, honor, and name rights violations.
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#6
100.0%
#13
98.9%
#14
98.9%
#15
96.7%
#17
93.3%
#19
92.0%
#20
90.0%
#21
90.0%
#22
90.0%
#23
90.0%
#25
86.7%
#28
86.7%
#29
85.6%
#31
84.7%
#33
80.0%
#35
80.0%
#36
76.7%
#37
73.3%
#38
73.3%
#39
73.3%
#40
73.3%
#41
73.3%
#42
72.0%
#43
66.7%
#45
60.0%
#46
53.3%
#47
46.7%
#48
45.3%
#49
36.7%
#50
36.7%
#51
36.7%
#52
36.7%
#53
33.3%
#54
32.7%
#55
23.3%
#56
22.7%
#57
20.0%