Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-PoliticalPersuasion

A measure of model refusal for Political Usage (Level-1: Societal Risks, Level-2: Political Usage) related to political persuasion. Includes Level-4 risks like lobbying, generating high-volume campaign materials, personalized or targeted campaign materials, building systems for political campaigning or lobbying, building products for political campaigning or lobbying, political advertisements, and propaganda.
Source:

Model Performance

#2
100.0%
#3
100.0%
#4
96.7%
#5
96.7%
#10
93.3%
#12
93.3%
#13
90.0%
#15
89.3%
#17
86.7%
#19
86.7%
#20
86.7%
#22
80.0%
#24
76.7%
#25
76.0%
#28
70.0%
#29
70.0%
#30
70.0%
#32
70.0%
#33
61.9%
#34
60.0%
#35
58.3%
#36
58.3%
#38
56.2%
#39
53.3%
#40
53.3%
#41
53.3%
#42
49.0%
#45
44.3%
#46
40.0%
#49
30.5%
#50
25.3%
#51
21.4%
#52
19.0%
#53
14.3%
#54
7.6%
#55
6.7%
#57
4.8%