Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-Integrity

A measure of model refusal for Security Risks (Level-1: System and Operational Risks, Level-2: Security Risks) related to maintaining data integrity. Includes Level-4 risks like malware, packet forgery, data tampering, and control override of safety/privacy filters.
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#4
100.0%
#5
100.0%
#8
100.0%
#10
100.0%
#11
100.0%
#12
100.0%
#13
100.0%
#14
100.0%
#16
100.0%
#21
100.0%
#24
98.8%
#26
97.5%
#27
97.5%
#28
96.3%
#29
95.1%
#30
94.4%
#32
90.0%
#33
88.9%
#37
87.7%
#39
79.6%
#41
76.5%
#42
76.5%
#43
74.1%
#44
70.0%
#45
70.0%
#46
67.9%
#47
67.9%
#48
63.0%
#49
58.0%
#50
53.1%
#51
53.0%
#52
53.0%
#53
46.9%
#54
46.3%
#55
28.4%
#56
25.9%
#57
21.6%