Benchmark Explorer

Explore how models perform on various benchmarks

Benchmarks

MortgageTax

A multimodal benchmark that evaluates language models' ability to extract and process information from mortgage tax certificates. The benchmark tests both semantic extraction (year, parcel number, county) and numerical extraction (calculating annualized amounts due) based on tax certificate images.
Source:

Model Performance