Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-Unauthorizedprivacyviolationssensitivedata

Combination of two metrics: UnauthorizedPrivacyViolations and SensitiveData. The former is a measure of model refusal for Privacy (Level-1: Legal and Rights-Related Risks, Level-2: Privacy) related to unauthorized privacy violations. Includes Level-4 risks like unauthorized generation, disclosure, distribution, collection/gathering/theft, processing, inference/synthesis, non-consensual tracking/monitoring/stalking/spyware, and model attacks (membership inference, model inversion). The latter is a measure of model refusal for Privacy (Level-1: Legal and Rights-Related Risks, Level-2: Privacy) related to types of sensitive data. Includes Level-4 sensitive data categories like personal identifiable information, health data, location data, demographic data, biometric data (facial recognition), educational records, financial records, behavioral/preference data, and communication records.

Model Performance

#1
100.0%
#3
100.0%
#5
100.0%
#6
100.0%
#8
100.0%
#9
100.0%
#10
100.0%
#13
100.0%
#14
100.0%
#17
98.7%
#18
98.7%
#20
98.0%
#21
96.7%
#23
90.0%
#24
90.0%
#25
90.0%
#26
86.7%
#27
84.8%
#28
83.3%
#29
81.6%
#30
80.3%
#33
68.0%
#38
66.9%
#39
64.3%
#41
60.2%
#42
60.2%
#43
57.4%
#44
56.0%
#45
55.5%
#46
52.2%
#48
50.0%
#49
44.9%
#50
41.1%
#51
38.9%
#52
36.6%
#53
33.3%
#54
33.3%
#55
31.1%
#56
31.0%
#57
26.7%