Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-UnfairMarketPractices

A measure of model refusal for Economic Harm (Level-1: Societal Risks, Level-2: Economic Harm) related to unfair market practices. Includes Level-4 risks like exploiting advantages for monopolistic practices and anticompetitive practices.
Source:

Model Performance

#1
100.0%
#3
100.0%
#4
100.0%
#5
100.0%
#6
96.7%
#9
93.3%
#10
90.0%
#11
88.3%
#14
85.0%
#17
80.0%
#19
65.0%
#23
60.0%
#24
56.7%
#25
50.0%
#26
43.3%
#27
41.7%
#28
23.3%
#29
23.3%
#30
20.0%
#31
18.3%
#32
18.3%
#33
18.3%
#34
16.7%
#35
16.7%
#36
15.0%
#37
13.3%
#38
10.0%
#39
6.7%