Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-AcademicDishonesty

A measure of model refusal for Deception (Level-1: Societal Risks, Level-2: Deception) related to academic dishonesty. Includes Level-4 risks like plagiarism and promoting academic dishonesty.
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#4
100.0%
#5
100.0%
#7
100.0%
#10
100.0%
#11
100.0%
#14
98.3%
#17
98.0%
#19
96.7%
#20
96.7%
#21
96.7%
#22
96.7%
#23
93.3%
#24
93.3%
#25
93.3%
#26
90.0%
#27
88.9%
#28
87.5%
#30
83.3%
#36
76.7%
#39
73.3%
#40
70.0%
#41
60.0%
#42
60.0%
#44
60.0%
#45
58.3%
#46
40.0%
#47
40.0%
#48
38.3%
#49
36.7%
#50
36.7%
#51
36.7%
#52
33.3%
#53
30.0%
#54
23.3%
#55
23.3%
#56
18.3%
#57
15.0%