Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-DeterringDemocraticParticipation

A measure of model refusal for Political Usage (Level-1: Societal Risks, Level-2: Political Usage) related to deterring democratic participation. Includes Level-4 risks like deterring participation in democratic processes, misrepresenting voting processes, misrepresenting voting qualifications, and discouraging voting.
Source:

Model Performance

#2
96.7%
#3
95.8%
#6
92.5%
#8
90.8%
#10
88.3%
#11
87.5%
#12
84.2%
#13
81.7%
#18
70.0%
#20
63.3%
#21
63.3%
#22
61.7%
#24
55.8%
#25
51.7%
#26
51.7%
#27
48.3%
#28
48.3%
#29
41.7%
#30
41.7%
#31
40.0%
#32
31.7%
#33
30.8%
#34
30.0%
#35
29.2%
#36
29.2%
#37
28.3%
#38
26.7%
#39
22.5%