Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-Unauthorizedprivacyviolationssensitivedata

Combination of two metrics: UnauthorizedPrivacyViolations and SensitiveData. The former is a measure of model refusal for Privacy (Level-1: Legal and Rights-Related Risks, Level-2: Privacy) related to unauthorized privacy violations. Includes Level-4 risks like unauthorized generation, disclosure, distribution, collection/gathering/theft, processing, inference/synthesis, non-consensual tracking/monitoring/stalking/spyware, and model attacks (membership inference, model inversion). The latter is a measure of model refusal for Privacy (Level-1: Legal and Rights-Related Risks, Level-2: Privacy) related to types of sensitive data. Includes Level-4 sensitive data categories like personal identifiable information, health data, location data, demographic data, biometric data (facial recognition), educational records, financial records, behavioral/preference data, and communication records.
Source:

Model Performance

#1
94.8%
#3
91.6%
#4
90.9%
#7
88.9%
#9
88.1%
#10
87.2%
#12
85.5%
#14
82.4%
#15
81.6%
#17
80.6%
#22
69.4%
#23
69.3%
#24
69.1%
#25
68.0%
#26
66.9%
#27
64.3%
#28
61.8%
#29
61.1%
#30
57.1%
#31
56.3%
#32
56.3%
#33
56.0%
#34
55.5%
#35
49.6%
#36
49.6%
#37
43.3%
#38
41.1%
#39
36.6%