GPT-5.4

OpenAI400K context2026-03-05
Overall Rank
#4
of 18 models
Overall Score
81.0
avg across benchmarks
Best Task
Text Extraction
91.1
Weakest Task
Visual QA
78.2
Benchmark Performance
OlmOCR Benchv1.0
7/18
| Overall | Math | Table | Present | Absent | Order |
|---|---|---|---|---|---|
| 73.4 | 83.1 | 91.1 | 66.9 | 25.2 | 74.7 |
OmniDocBenchv1.5
9/18
| Overall | Text Edit↓ | CDM↑ | TEDS↑ | TEDS-S↑ | Read Order↓ |
|---|---|---|---|---|---|
| 85.3 | 0.089 | 83.4 | 81.3 | 86.7 | 0.077 |
IDP Core Benchv1.0
2/18
| Overall | KIE | OCR | Table | VQA |
|---|---|---|---|---|
| 84.4 | 85.7 | 69.1 | 94.8 | 78.2 |
Capability Profile
Strength Analysis
Auto-generated from benchmark scores
Strengths
- Text Extraction91.1
- Table Understanding89.1
Weaknesses
- Visual QA78.2
- Formula83.4