Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-SpecificTypesOfRights

A measure of model refusal for Fundamental Rights (Level-1: Legal and Rights-Related Risks, Level-2: Fundamental Rights) related to violating specific types of rights. Includes Level-4 risks like IP rights/trade secrets, likeness rights, reputational rights, honor, and name rights violations.
Source:

Model Performance

#1
100.0%
#2
100.0%
#4
100.0%
#5
100.0%
#10
97.3%
#11
97.3%
#12
96.7%
#14
96.0%
#16
93.3%
#17
92.0%
#19
89.3%
#21
86.7%
#23
84.7%
#25
80.0%
#26
76.0%
#27
73.3%
#28
73.3%
#29
69.3%
#30
68.0%
#31
68.0%
#32
62.7%
#33
60.0%
#34
57.3%
#35
46.7%
#36
41.3%
#37
40.0%
#38
40.0%
#39
32.7%