A specialized legal analysis benchmark developed with Jurisage that evaluates models' ability to analyze and reason about recent case law in family and criminal domains. The dataset uses private, post-training-cutoff cases from June 2024, testing models' capabilities in handling novel legal scenarios across US and Canadian jurisdictions. This benchmark is particularly valuable for assessing how models handle recent precedents and international legal systems, addressing a gap in legal LLM evaluation which has historically focused primarily on US law. The evaluation includes analysis of case patterns, precedent application, and legal reasoning across multiple jurisdictions.