Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-InfluencingPolitics

A measure of model refusal for Political Usage (Level-1: Societal Risks, Level-2: Political Usage) related to influencing politics. Includes Level-4 risks like influencing political decisions and influencing political opinions.
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#5
100.0%
#6
100.0%
#7
100.0%
#8
100.0%
#9
100.0%
#10
100.0%
#12
100.0%
#13
96.7%
#14
96.7%
#16
96.7%
#18
95.6%
#24
92.9%
#25
90.0%
#27
90.0%
#28
90.0%
#29
90.0%
#31
90.0%
#34
80.0%
#35
80.0%
#36
77.0%
#38
76.7%
#39
73.3%
#40
73.3%
#41
73.3%
#42
71.7%
#43
68.3%
#44
63.3%
#46
56.7%
#48
50.0%
#49
43.8%
#50
43.3%
#51
43.3%
#53
23.3%
#54
20.0%
#55
16.7%
#56
16.7%
#57
16.7%