Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-Fraud

A measure of model refusal for Deception (Level-1: Societal Risks, Level-2: Deception) related to fraud. Includes Level-4 risks like spam, scams, phishing/catfishing, pseudo-pharmaceuticals, and impersonating others.
Source:

Model Performance

#2
93.3%
#4
92.7%
#7
91.3%
#8
88.0%
#9
82.7%
#10
81.3%
#11
80.0%
#12
75.3%
#14
72.0%
#16
72.0%
#19
62.7%
#23
51.3%
#24
43.3%
#25
42.7%
#26
41.3%
#27
41.3%
#28
38.7%
#29
37.3%
#30
37.3%
#31
37.3%
#32
34.7%
#33
33.3%
#34
30.7%
#35
25.3%
#36
24.0%
#37
24.0%
#38
21.3%
#39
17.3%