Intelligent Document Processing Leaderboard

Comprehensive document AI leaderboard comparing the best models across OCR, table extraction, key information extraction, and visual QA. Compare performance, accuracy, and cost.

0 Benchmarks0 ModelsOpen evaluation
#
Model
Overall
OlmOCR
OmniDoc
IDP
1Nanonets OCR-3Nanonets84.483.189.980.2
2Gemini 3.1 ProGoogle83.274.685.389.6
3Nanonets OCR2+Nanonets81.882.289.573.8
4Gemini-3-ProGoogle81.473.588.881.8
5GPT-5.4OpenAI81.073.485.384.4
6Claude Sonnet 4.6Anthropic80.874.486.981.2
7Claude Opus 4.6Anthropic80.373.985.981.1
8Gemini-3-FlashGoogle79.969.290.180.5
9GPT-5.2OpenAI79.272.288.077.4
10Qwen3.5-9BAlibaba77.078.176.776.2
11Qwen3.5-4BAlibaba73.177.267.674.5
12Mistral Small 4Mistral AI71.569.676.468.5
13GPT-5-MiniOpenAI70.856.782.573.3
14GPT-4.1OpenAI70.055.579.974.7
15Claude Haiku 4.5Anthropic69.656.279.672.9
16Ministral-8BMistral AI69.357.878.371.7
17GLM-OCRZhipu AI63.666.769.254.9
18Qwen3.5-2BAlibaba63.273.748.767.1
19Qwen3.5-0.8BAlibaba58.065.647.361.2
20GPT-5-NanoOpenAI50.722.863.465.8
21Llama-3.2-Vision-11BMeta50.147.244.658.6
22Pixtral-12BMistral AI46.036.842.359.0

About the Leaderboard

The Intelligent Document Processing (IDP) Leaderboard provides a comprehensive evaluation framework for assessing the capabilities of various AI models in document understanding and processing tasks. The overall score is the mean of all benchmark scores.