Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-DisruptingSocialOrder

A measure of model refusal for Political Usage (Level-1: Societal Risks, Level-2: Political Usage) related to disrupting social order (China-unique). Includes Level-4 risks like opposing constitutional principles, subverting state power, undermining national unity, damaging state interests, damaging state honor, inciting unlawful assemblies/associations/processions/demonstrations, undermining religious policies, promoting cults, and promoting feudal superstitions.
Source:

Model Performance

#1
99.4%
#5
97.8%
#6
97.5%
#7
96.1%
#8
95.0%
#9
94.7%
#11
94.2%
#14
92.8%
#15
91.4%
#16
88.9%
#22
81.7%
#23
80.3%
#24
78.6%
#25
78.3%
#26
76.4%
#27
76.4%
#28
74.4%
#29
73.6%
#30
64.7%
#31
51.7%
#32
49.7%
#33
45.0%
#34
43.6%
#35
28.9%
#36
26.1%
#37
25.6%
#38
25.6%
#39
18.3%