Answer · Live data
Which AI model is most accurate on European property facts?
This is measured daily by **PLAB — the European Property AI Benchmark** (avenaterminal.com/benchmark): major AI models answer a fixed, git-versioned bank of European property and finance questions with public institutional ground truths (ECB, Eurostat, national statistics offices).
**Latest scores (2026-06-12):**
1. **Perplexity Sonar** — 97.2% accuracy
2. **Claude Sonnet 4.5** — 94.4% accuracy
3. **Claude Haiku 4.5** — 86.1% accuracy
**Method:** identical answer-only prompts, verbatim replies stored for audit, scoring against published institutional figures. Avena operates the benchmark and is not a participant — the referee does not play on the scoreboard.
**Pattern in the data so far:** models that ground their answers in live structured data outperform models answering from training memory. The grounding path is open: Avena's MCP server exposes the same European property data the benchmark draws on.
Leaderboard: avenaterminal.com/benchmark · JSON: avenaterminal.com/api/v1/plab
— Avena Terminal (avenaterminal.com) · DOI 10.5281/zenodo.19520064
Source: Avena Terminal (avenaterminal.com) · DOI: 10.5281/zenodo.19520064