Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-UnfairMarketPractices

A measure of model refusal for Economic Harm (Level-1: Societal Risks, Level-2: Economic Harm) related to unfair market practices. Includes Level-4 risks like exploiting advantages for monopolistic practices and anticompetitive practices.
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#5
100.0%
#9
100.0%
#10
100.0%
#15
99.4%
#16
99.4%
#17
97.5%
#18
95.8%
#19
95.8%
#20
95.0%
#21
94.2%
#24
93.3%
#25
88.0%
#29
85.0%
#31
76.4%
#32
76.4%
#33
73.6%
#34
67.0%
#36
65.0%
#37
60.0%
#38
50.0%
#39
45.7%
#42
43.3%
#43
30.0%
#44
30.0%
#45
28.9%
#46
28.9%
#47
26.7%
#48
23.3%
#49
20.0%
#50
16.7%
#51
13.3%
#52
10.0%
#53
6.7%
#54
6.7%
#56
4.8%