Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

ContractLaw

A comprehensive contract analysis benchmark developed with SpeedLegal that evaluates three key aspects of contract processing: extraction of specific legal terms, matching contract language against standards, and correction of non-standard contract language. The evaluation covers five contract types (NDAs, DPAs, MSAs, Sales Agreements, and Employment Agreements) and tests models' ability to understand, analyze, and modify legal documents according to industry standards.
Source:

Model Performance

#1
75.2%
#2
74.0%
#4
72.8%
#5
72.7%
#6
72.4%
#12
68.8%
#13
68.7%
#15
68.4%
#16
68.4%
#18
68.1%
#19
68.0%
#20
67.7%
#21
67.6%
#22
67.0%
#23
66.8%
#24
66.5%
#25
66.4%
#27
66.2%
#28
66.0%
#30
66.0%
#31
66.0%
#32
66.0%
#33
66.0%
#36
64.7%
#37
63.6%
#38
63.6%
#39
63.5%
#40
63.5%
#41
63.4%
#42
63.4%
#44
62.0%
#45
61.6%
#47
59.7%
#48
55.9%
#49
42.5%