Back to Leaderboard
Closed

GPT-4.1

OpenAI1048K context$2/1K pages2025-04-14
Overall Rank
#8
of 16 models
Overall Score
70.0
avg across benchmarks
Best Task
Key Information Extraction
87.1
Weakest Task
Visual QA
63.0

Benchmark Performance

OlmOCR Benchv1.0
12/16
OverallMathTablePresentAbsentOrder
55.560.059.147.334.959.4
OmniDocBenchv1.5
9/16
OverallText Edit↓Formula CDM↑Table TEDS↑TEDS-S↑Read Order↓
79.90.16782.274.083.80.115
IDP Core Benchv1.0
6/16
OverallKIEOCRTableVQA
74.787.175.673.163.0

Capability Profile

Strength Analysis

Auto-generated from benchmark scores

Strengths

  • Key Information Extraction87.1
  • Text Extraction83.3

Weaknesses

  • Visual QA63.0
  • Table Understanding68.7