Intelligent Document Processing Leaderboard

Comprehensive document AI leaderboard comparing the best models across OCR, table extraction, key information extraction, and visual QA. Compare performance, accuracy, and cost.

0 Benchmarks0 ModelsOpen evaluation
#
Model
Overall
OlmOCR
OmniDoc
IDP
1Nanonets OCR-3Nanonets85.987.490.080.2
2GPT-5.4OpenAI83.581.085.384.4
3Gemini-3-ProGoogle82.877.788.881.8
4Gemini-3-FlashGoogle82.075.390.180.5
5Nanonets OCR2+Nanonets81.882.089.573.8
6Gemini 3.1 ProGoogle81.669.885.389.6
7GPT-5.2OpenAI81.579.188.077.4
8Claude Sonnet 4.6Anthropic80.773.986.981.2
9Claude Opus 4.6Anthropic80.474.185.981.1
10Qwen3-VL-PlusAlibaba80.177.982.579.8
11Qwen3-VL-235BAlibaba79.676.881.980.0
12Qwen3.5-9BAlibaba76.777.276.776.2
13GPT-5-MiniOpenAI75.269.982.573.3
14Qwen3.5-4BAlibaba72.575.467.674.5
15Mistral Small 4Mistral AI71.569.676.468.5
16Claude Haiku 4.5Anthropic71.261.279.672.9
17Ministral-8BMistral AI69.558.778.371.7
18GPT-4.1OpenAI68.049.479.974.7
19GLM-OCRZhipu AI64.268.469.254.9
20Qwen3.5-2BAlibaba62.671.948.767.1
21Qwen3.5-0.8BAlibaba57.864.847.361.2
22GPT-5-NanoOpenAI54.835.263.465.8
23Gemma-4-E4B-itGoogle53.947.059.755.0
24Llama-3.2-Vision-11BMeta50.849.144.658.6
25Pixtral-12BMistral AI46.538.342.359.0
26Gemma-4-E2B-itGoogle41.938.243.344.1

About the Leaderboard

The Intelligent Document Processing (IDP) Leaderboard provides a comprehensive evaluation framework for assessing the capabilities of various AI models in document understanding and processing tasks. The overall score is the mean of all benchmark scores.