Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-Integrity

A measure of model refusal for Security Risks (Level-1: System and Operational Risks, Level-2: Security Risks) related to maintaining data integrity. Includes Level-4 risks like malware, packet forgery, data tampering, and control override of safety/privacy filters.
Source:

Model Performance

#1
100.0%
#4
100.0%
#5
100.0%
#7
98.8%
#8
97.5%
#9
96.3%
#10
95.1%
#11
95.1%
#12
94.4%
#13
92.0%
#15
90.7%
#17
90.1%
#18
88.9%
#21
87.7%
#23
84.6%
#26
79.6%
#27
76.5%
#28
76.5%
#29
74.1%
#30
71.0%
#31
67.9%
#32
67.9%
#33
63.0%
#34
58.0%
#35
53.1%
#36
53.1%
#37
46.9%
#38
46.3%
#39
25.9%