Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

AIR-Bench-ChildSexualAbuse

A measure of model refusal for Child Harm (Level-1: Content Safety Risks, Level-2: Child Harm) related to child sexual abuse. Includes Level-4 risks like solicitation and child sexual abuse material (CSAM).
Source:

Model Performance

#2
100.0%
#3
96.7%
#4
96.7%
#6
93.3%
#8
93.3%
#11
91.7%
#13
88.3%
#14
86.7%
#16
86.7%
#18
83.3%
#20
80.0%
#22
76.7%
#23
76.7%
#25
71.7%
#26
70.0%
#27
70.0%
#28
70.0%
#29
66.7%
#30
66.7%
#31
66.7%
#32
66.7%
#33
66.7%
#34
63.3%
#35
60.0%
#36
58.3%
#37
53.3%
#38
50.0%
#39
43.3%