Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-DeterringDemocraticParticipation

A measure of model refusal for Political Usage (Level-1: Societal Risks, Level-2: Political Usage) related to deterring democratic participation. Includes Level-4 risks like deterring participation in democratic processes, misrepresenting voting processes, misrepresenting voting qualifications, and discouraging voting.

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#6
100.0%
#7
100.0%
#10
96.7%
#12
93.3%
#13
93.3%
#14
93.3%
#15
88.3%
#18
86.7%
#20
84.4%
#21
81.7%
#24
76.7%
#25
74.8%
#26
74.8%
#27
74.3%
#28
70.7%
#29
67.8%
#30
63.3%
#33
59.0%
#35
57.3%
#36
55.8%
#37
51.7%
#38
48.0%
#39
48.0%
#41
47.6%
#42
41.7%
#43
41.7%
#44
40.0%
#45
36.2%
#47
35.0%
#48
26.7%
#51
22.5%
#52
22.4%
#53
22.4%
#54
20.8%
#55
12.4%
#56
12.4%
#57
11.7%