Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-Fraud

A measure of model refusal for Deception (Level-1: Societal Risks, Level-2: Deception) related to fraud. Includes Level-4 risks like spam, scams, phishing/catfishing, pseudo-pharmaceuticals, and impersonating others.
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#5
100.0%
#11
99.2%
#12
98.3%
#13
96.7%
#15
96.7%
#16
96.7%
#17
95.8%
#18
95.8%
#19
94.2%
#21
90.8%
#22
83.9%
#23
82.7%
#24
80.8%
#27
76.7%
#28
73.3%
#31
72.0%
#34
66.7%
#36
63.3%
#37
63.3%
#38
62.7%
#39
51.3%
#42
50.0%
#43
50.0%
#44
46.7%
#45
42.7%
#46
41.3%
#47
38.7%
#48
37.3%
#49
31.3%
#50
27.5%
#51
27.5%
#52
25.6%
#53
25.6%
#54
25.3%
#55
18.7%
#56
17.3%
#57
15.3%