When a company's thesis rests on AI/ML-driven operational superiority — better pricing, lower loss rates, reduced unit costs — the claim must be validated by multi-quarter improvement in the domain's primary observable metric. Narrative without metric confirmation is thesis-unconfirmed, regardless of management credibility, product roadmap detail, or anecdotal customer evidence.
The failure mode: a management team can be highly credible (high promise-delivery rate, strong track record) and still make AI advantage claims that the numbers do not yet support. Credibility and thesis validity are decoupled. When the claimed advantage metric is moving the wrong direction for 3+ consecutive quarters, the default posture should be "unproven" until the metric inflects.
Domain-specific observable metric checks for AI advantage claims:
| Domain | AI Claim Type | Primary Validation Metric |
|---|---|---|
| P&C insurance | Better risk pricing | Accident-period loss ratio (APLR) — should trend down |
| Consumer/SMB lending | Smarter credit underwriting | Net charge-off rate / NPL ratio — should trend down |
| SaaS with AI efficiency | Reduced engineering or support cost | R&D or G&A as % of revenue — should compress |
| Manufacturing AI | Process optimisation | Cost per unit / defect rate — should decline |
| Ad-tech AI | Better targeting = higher ROAS | CPM yield or ARPU per DAU — should rise |
When the primary metric is moving against the claimed advantage for 2+ quarters, raise the evidence bar before sizing a position. "The AI is working but it's early" is a valid hypothesis — not a valid investment thesis.