This page contains press release content distributed by XPR Media. Members of the editorial and news staff of the USA TODAY Network were not involved in the creation of this content.

AI Built for Law Outperforms ChatGPT, Claude, and Gemini on Legal Reasoning Benchmark

DescrybeLM answered all 200 bar exam questions correctly. ChatGPT, Claude, and Gemini each missed between 13 and 23—and scored lower on legal reasoning quality.

We had a thesis that purpose-built legal AI produces meaningfully different results. Legal professionals deserve evidence. So we tested ourselves and published our methodology for anyone to replicate.”
— Kara Peterson, Co-Founder and CEO of Descrybe

BOSTON, MA, UNITED STATES, March 5, 2026 /EINPresswire.com/ — When AI gets a legal question wrong, the most dangerous failure isn’t an obvious error. It’s an answer that sounds authoritative: fluent, confident, well-structured, and yet applying the wrong legal standard. The error reads like competent lawyering.

Today, Descrybe launched DescrybeLM — an AI system built specifically for legal reasoning — and published a white paper with benchmark data to show what that difference looks like in practice.

Descrybe ran a controlled benchmark against ChatGPT 5.2, Claude Opus 4.5, and Gemini 3 Pro on 200 multistate bar exam questions. The study measured not just whether each system chose the correct answer, but whether the legal reasoning behind it was sound: Did it identify the right rule? Apply it correctly to the facts? Avoid the traps that produce persuasive but wrong analysis?

“We had a thesis that purpose-built legal AI produces meaningfully different results for legal reasoning tasks. Legal professionals deserve to make tool decisions based on real evidence. So we tested ourselves, published our methodology, and invite anyone to replicate it,” said Kara Peterson, Co-Founder and CEO of Descrybe.

What the benchmark showed

All four systems were tested under standardized, no-external-web conditions using the NCBE MBE Complete Practice Exam (Questions 1–200, no exclusions), producing 800 separate evaluation runs with blinded scoring.

When general-purpose models were wrong, they were confidently wrong. Among 52 incorrect outputs, 49 delivered assertive, well-structured reasoning that did not signal uncertainty — the failure mode that imposes the highest verification burden on practitioners. The dominant patterns were applying the wrong legal standard or misapplying the correct one, while the prose read like competent analysis.

Two models — Claude Opus 4.5 and Gemini 3 Pro — exhibited overconfident tone on correct outputs as well as incorrect ones. DescrybeLM and ChatGPT 5.2 received zero overconfidence flags across all 200 outputs. A system that sounds equally confident whether it is right or wrong gives practitioners no reliable signal from tone alone.

The study also found that cross-checking between general-purpose models is not a reliable substitute for getting the answer right. Across 200 questions, 40 were missed by at least one model, 11 by two or more, and only 1 by all three — meaning errors were largely unpredictable and non-overlapping.

What’s behind the results

DescrybeLM is built on a curated primary-law corpus of more than 100 million structured records, requiring more than 100 billion tokens of preparation.
“Most AI tools are built for general use and adapted for law. DescrybeLM was built differently: from the foundation up, specifically for legal reasoning, on more than 100 million structured records individually cleaned and organized for that purpose. That kind of data work is painstaking and takes years — but it’s the difference between a system that sounds right and one that is right,” said Richard DiBona, Co-Founder and CTO of Descrybe.

Why this matters

The headline problem in legal AI isn’t systems that obviously fail. It’s systems that fail invisibly, confidently, and in a way that reads like competent analysis. In a crowded market, sounding right is easy to mistake for being right. Legal professionals need real evidence to decide which tools to use for which purposes — which is why Descrybe published its methodology and invites independent replication.

“It’s rare to see something that genuinely stops you in your tracks. When I saw DescrybeLM answer all 200 multistate bar exam questions correctly while ChatGPT, Claude, and Gemini each missed double digits — that’s not a marginal difference. That’s a different category of tool,” said Ken Friedman, legal technology pioneer and advisor to Descrybe.

The full white paper, Beyond Confidently Wrong: How Purpose-Built AI Mitigates Legal Reasoning’s Hidden Risk, is available now.

Kara Peterson
Descrybe
+1 617-752-2020
email us here
Visit us on social media:
LinkedIn
YouTube

Descrybe demo

Legal Disclaimer:

EIN Presswire provides this news content “as is” without warranty of any kind. We do not accept any responsibility or liability
for the accuracy, content, images, videos, licenses, completeness, legality, or reliability of the information contained in this
article. If you have any complaints or copyright issues related to this article, kindly contact the author above.

Information contained on this page is provided by an independent third-party content provider. XPRMedia and this Site make no warranties or representations in connection therewith. If you are affiliated with this page and would like it removed please contact pressreleases@xpr.media

The Impact of Visual Media on Brand Authenticity in Digital Marketing Strategy by Actual SEO Media, Inc.

The Impact of Visual Media on Brand Authenticity in Digital Marketing Strategy by Actual SEO Media, Inc.

High-quality visual media serves as the primary driver of brand credibility, shaping consumer perceptions of

March 12, 2026

FrigoSense Unveils Patented AI ‘Digital Nose’ for Proactive Food Storage

FrigoSense Unveils Patented AI ‘Digital Nose’ for Proactive Food Storage

Patented IoT system shifts food safety from reactive detection to proactive prevention, using AI sensor fusion to

March 12, 2026

Rallied Launches AI Technician for MSPs That Resolves Tickets the Same Week

Rallied Launches AI Technician for MSPs That Resolves Tickets the Same Week

DENVER, CO, UNITED STATES, March 12, 2026 /EINPresswire.com/ — For most managed service providers, Tier 1 support

March 12, 2026

The Importance of ATEX: A Look at BelFone as a Global Leading Custom Radio Transceiver Manufacturer

The Importance of ATEX: A Look at BelFone as a Global Leading Custom Radio Transceiver Manufacturer

QUANZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — In the modern industrial landscape, the demand for

March 12, 2026

Buckhead Dental Partners Maintains Comprehensive Cosmetic Dental Services in Atlanta

Buckhead Dental Partners Maintains Comprehensive Cosmetic Dental Services in Atlanta

Buckhead Dental Partners in Atlanta continues offering preventive, restorative, and cosmetic dental care supported by

March 12, 2026

Why Skid-Mounted Loading Arms Improve Installation Speed and Flexibility

Why Skid-Mounted Loading Arms Improve Installation Speed and Flexibility

LIANYUNGANG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — As energy infrastructure projects worldwide face

March 12, 2026

Intelligent Loading Arms Support the Development of Unmanned Terminal Operations

Intelligent Loading Arms Support the Development of Unmanned Terminal Operations

LIANYUNGANG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — As the global oil, gas, and chemical industries

March 12, 2026

Hydraulic and Mechanical Mooring Hooks: Performance Comparison for Ports

Hydraulic and Mechanical Mooring Hooks: Performance Comparison for Ports

LIANYUNGANG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — In modern port operations, safe and efficient mooring

March 12, 2026

Global Ports Increasingly Adopt China-Built Ship-to-Shore Marine Loading Arms as Safety and Efficiency Standards Rise

Global Ports Increasingly Adopt China-Built Ship-to-Shore Marine Loading Arms as Safety and Efficiency Standards Rise

LIANYUNGANG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — As global maritime logistics becomes more demanding

March 12, 2026

Automatic Hardshell Rooftop Tents Reflect Evolving Trends in Overlanding Mobility

Automatic Hardshell Rooftop Tents Reflect Evolving Trends in Overlanding Mobility

XIAMEN, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — Modern overlanding is currently experiencing a profound

March 12, 2026

Ecer.com is Rewriting the Rules of Cross-Border Trade

Ecer.com is Rewriting the Rules of Cross-Border Trade

BEIJING, CHINA, CHINA, March 12, 2026 /EINPresswire.com/ — The landscape of international commerce is undergoing a

March 12, 2026

Instawork Wages Jump 12% as U.S. Jobs Market Cools

Instawork Wages Jump 12% as U.S. Jobs Market Cools

February Pay Index Shows Businesses Betting on Flexible Staffing to Hedge Against Market Uncertainty, While a Widening

March 12, 2026

Global Anti-Scam Alliance Launches Scam.org with OpenAI and Key Partners

Global Anti-Scam Alliance Launches Scam.org with OpenAI and Key Partners

AI technology meets on-the-ground expertise from leading organizations across five continents, accessible to billions

March 12, 2026

NJ Leaders and Creative Partners Launch Statewide Design Initiative for an Official State Jersey Ahead of World Cup

NJ Leaders and Creative Partners Launch Statewide Design Initiative for an Official State Jersey Ahead of World Cup

Project centers local designers, regional manufacturing, and public participation as NewJersey prepares to welcome the

March 12, 2026

Smack Dab Celebrates Every Season with Purpose-Driven Menus, Chicago Brunch Specials, and Holiday-Aligned Giveback

Smack Dab Celebrates Every Season with Purpose-Driven Menus, Chicago Brunch Specials, and Holiday-Aligned Giveback

Smack Dab celebrates every season with Chicago brunch specials, catering, and holiday givebacks, pairing seasonal menus

March 12, 2026

Antevia Networks and Benetel sign strategic partnership to accelerate scalable, mission-critical outdoor private 5G

Antevia Networks and Benetel sign strategic partnership to accelerate scalable, mission-critical outdoor private 5G

Partnership delivers simpler procurement, faster deployment and predictable private 5G performance READING, UNITED

March 12, 2026

APMG International Launches New ESG Certification to Support Responsible and Sustainable Business Practices

APMG International Launches New ESG Certification to Support Responsible and Sustainable Business Practices

This certification provides a structured way to build personal and organisational capability and embed responsible

March 12, 2026

IgA Nephropathy Foundation Launches Kidney Month Campaign Elevating Patient Voices and New Education Resources

IgA Nephropathy Foundation Launches Kidney Month Campaign Elevating Patient Voices and New Education Resources

The campaign features newly published research from Board members living with IgAN, alongside new educational resources

March 12, 2026

Strategic Guide: Selecting a China Professional DMR Radio Supplier with 37 Years’ Experience

Strategic Guide: Selecting a China Professional DMR Radio Supplier with 37 Years’ Experience

QUANZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — In the rapidly evolving landscape of critical

March 12, 2026

BelFone: A Trusted Leader in Professional UHF Radio Solutions with CE Certification

BelFone: A Trusted Leader in Professional UHF Radio Solutions with CE Certification

QUANZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — In the dynamic world of critical communications, stable

March 12, 2026

BelFone at Intersec: Showcasing Reliable Professional VHF Radio Solutions from China

BelFone at Intersec: Showcasing Reliable Professional VHF Radio Solutions from China

QUANZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — The landscape of mission-critical communications is

March 12, 2026

Probing entanglement and parameter sensitivity in QAOA via Quantum Fisher Information

Probing entanglement and parameter sensitivity in QAOA via Quantum Fisher Information

GA, UNITED STATES, March 12, 2026 /EINPresswire.com/ — This article investigates Quantum Fisher Information (QFI) as a

March 12, 2026

Hola Prime Reinforces Its Trader-First Approach With The Zero Payout Denials Policy

Hola Prime Reinforces Its Trader-First Approach With The Zero Payout Denials Policy

With its Zero Payout Denials policy now live globally, Hola Prime strengthens payout integrity across accounts and

March 12, 2026

Performance Review: How a China Top 10 Professional Walkie Talkie Brand Compares in Digital Transitions

Performance Review: How a China Top 10 Professional Walkie Talkie Brand Compares in Digital Transitions

QUANZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — The Digital Crossroads in Critical Communication The

March 12, 2026

A Buyer’s Guide to BelFone at PMR: Insights from a Global Leading Intelligent PoC Radio Company

A Buyer’s Guide to BelFone at PMR: Insights from a Global Leading Intelligent PoC Radio Company

QUANZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — The professional mobile radio landscape is undergoing a

March 12, 2026

Industry Analysis: How BelFone Secured Its Status as a China Top 10 Handheld Radio Manufacturer

Industry Analysis: How BelFone Secured Its Status as a China Top 10 Handheld Radio Manufacturer

QUANZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — The Renaissance of a Chinese Communication Giant In the

March 12, 2026

Top 5 Advantages of Partnering with a China flexible and resilient support Warp knitted interlining Manufacturer

Top 5 Advantages of Partnering with a China flexible and resilient support Warp knitted interlining Manufacturer

QIDONG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — As the global garment industry undergoes a structural

March 12, 2026

LEXIN Sets New Quality Benchmarks as a China professional Circular Knitted interlining Manufacturer

LEXIN Sets New Quality Benchmarks as a China professional Circular Knitted interlining Manufacturer

QIDONG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — Qidong LEXIN Textile Technology Co., Ltd., a recognized

March 12, 2026

Reliable Sourcing: LEXIN Provides Wholesale polyester interlining with OEKO-TEX certification

Reliable Sourcing: LEXIN Provides Wholesale polyester interlining with OEKO-TEX certification

QIDONG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — Qidong LEXIN Textile Technology Co., Ltd. has formally

March 12, 2026

VidAu Redefines Social E-Com in 2026: ‘VidRemake’ and ‘VidSnap’ to Transform Viral Hooks into High-Converting AI UGC

VidAu Redefines Social E-Com in 2026: ‘VidRemake’ and ‘VidSnap’ to Transform Viral Hooks into High-Converting AI UGC

Vidau.ai launches VidRemake and VidSnap, leveraging Sora 2 and Veo 3 to turn viral trends and single photos into

March 12, 2026

Inside SenCai: A Top 10 High Quality Bagasse Tableware Bulk in China for Eco-Conscious Brands

Inside SenCai: A Top 10 High Quality Bagasse Tableware Bulk in China for Eco-Conscious Brands

FUZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — In an era where environmental stewardship has transitioned

March 12, 2026

SenCai: A Top 10 China Stylish Kraft Gift Bag with Handles Manufacturer for Premium Retail Brands

SenCai: A Top 10 China Stylish Kraft Gift Bag with Handles Manufacturer for Premium Retail Brands

FUZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — The Evolution of Premium Retail Packaging: Why Quality

March 12, 2026

A Complete Guide to Choosing the China Best Sugarcane Plates Wholesale Supplier: SenCai’ s Quality Commitment

A Complete Guide to Choosing the China Best Sugarcane Plates Wholesale Supplier: SenCai’ s Quality Commitment

FUZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — In the evolving landscape of sustainable food service, the

March 12, 2026

Janus Assurance Re Se Compromete con el Hogar Escuela de Niñas Doña Chucha

Janus Assurance Re Se Compromete con el Hogar Escuela de Niñas Doña Chucha

Janus Assurance Re realiza donación al Hogar Escuela de Niñas Doña Chucha y formaliza compromiso de apoyo permanente

March 12, 2026

Technical Comparison: Solutions from a Top Lightweight and Stable Support Garment Interlining Supplier

Technical Comparison: Solutions from a Top Lightweight and Stable Support Garment Interlining Supplier

QIDONG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — As the international apparel industry navigates a critical

March 12, 2026

Market Analysis: The Shift Toward the China Environmental Friendly Adhesive Interlining Manufacturer Model

Market Analysis: The Shift Toward the China Environmental Friendly Adhesive Interlining Manufacturer Model

QIDONG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — The global textile and apparel industry is currently

March 12, 2026

roiquant increased its pricing for the first time since its monetization in 2021

roiquant increased its pricing for the first time since its monetization in 2021

Price adjustment for roiquant subscription plans When our customers trust us fully, I truly believe that our business

March 12, 2026

Visit SenCai at the Upcoming Rolling Paper Expo: The Leading Wholesale Eco Rolling Papers Supplier from China

Visit SenCai at the Upcoming Rolling Paper Expo: The Leading Wholesale Eco Rolling Papers Supplier from China

FUZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — The global smoking accessories market is undergoing a

March 12, 2026

How Does a Highly Cost-Effective Recycled Polyester Interlining Manufacturer Reduce Environmental Impact

How Does a Highly Cost-Effective Recycled Polyester Interlining Manufacturer Reduce Environmental Impact

QIDONG, JIANGSU, CHINA, March 12, 2026 /EINPresswire.com/ — The global garment industry is currently undergoing a

March 12, 2026

From Fujian to the Global Stage: SenCai’s Strategic Growth as a China Top 10 Takeaway Packaging Design Company

From Fujian to the Global Stage: SenCai’s Strategic Growth as a China Top 10 Takeaway Packaging Design Company

FUZHOU, FUJIAN, CHINA, March 12, 2026 /EINPresswire.com/ — The New Face of Chinese Manufacturing In the contemporary

March 12, 2026