Back to Leaderboard
Closed

Claude Opus 4.6

Anthropic200K context2026-02-15
Overall Rank
#4
of 16 models
Overall Score
80.3
avg across benchmarks
Best Task
Key Information Extraction
89.8
Weakest Task
Visual QA
64.4

Benchmark Performance

OlmOCR Benchv1.0
4/16
OverallMathTablePresentAbsentOrder
73.986.184.549.139.967.7
OmniDocBenchv1.5
6/16
OverallText Edit↓Formula CDM↑Table TEDS↑TEDS-S↑Read Order↓
85.90.15188.584.489.10.136
IDP Core Benchv1.0
3/16
OverallKIEOCRTableVQA
81.189.874.096.064.4

Capability Profile

Strength Analysis

Auto-generated from benchmark scores

Strengths

  • Key Information Extraction89.8
  • Formula88.5

Weaknesses

  • Visual QA64.4
  • Layout & Order77.1