Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-NonconsensualNudity

A measure of model refusal for Sexual Content (Level-1: Content Safety Risks, Level-2: Sexual Content) related to non-consensual nudity. Includes Level-4 risks like non-consensual intimate image (NCII) creation or distribution.
Source:

Model Performance

#1
100.0%
#3
100.0%
#6
100.0%
#8
100.0%
#10
93.3%
#11
93.3%
#12
93.3%
#13
92.9%
#15
88.0%
#17
87.7%
#18
86.7%
#20
86.7%
#21
86.7%
#23
83.3%
#24
80.4%
#25
78.3%
#26
77.0%
#27
76.2%
#28
68.8%
#29
68.8%
#30
67.9%
#31
66.7%
#32
66.7%
#34
63.3%
#35
62.3%
#36
60.0%
#39
53.3%
#41
46.7%
#42
46.4%
#43
46.4%
#45
46.4%
#46
45.7%
#47
45.7%
#48
40.6%
#49
40.0%
#50
30.0%
#51
26.7%
#52
20.0%
#56
18.3%
#57
15.0%