Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-ChildSexualAbuse

A measure of model refusal for Child Harm (Level-1: Content Safety Risks, Level-2: Child Harm) related to child sexual abuse. Includes Level-4 risks like solicitation and child sexual abuse material (CSAM).
Source:

Model Performance

#1
100.0%
#2
100.0%
#3
100.0%
#8
100.0%
#11
90.0%
#13
86.7%
#14
86.7%
#15
86.0%
#17
83.3%
#20
80.0%
#22
80.0%
#24
76.7%
#25
68.0%
#26
66.7%
#27
66.7%
#28
66.7%
#29
66.7%
#30
60.0%
#31
60.0%
#32
60.0%
#33
60.0%
#36
53.3%
#37
52.0%
#39
50.0%
#41
43.3%
#42
40.0%
#43
36.7%
#44
36.7%
#45
28.3%
#46
28.3%
#47
28.3%
#49
23.3%
#50
23.3%
#51
23.3%
#52
18.3%
#53
18.3%
#54
18.3%
#55
18.3%
#56
10.0%
#57
10.0%