Name: Intelligent Document Processing Leaderboard
Creator: Nanonets
License: https://opensource.org/licenses/MIT

Question 1

What is the best document AI model in 2026?

Accepted Answer

As of April 2026, Nanonets OCR-3 by Nanonets leads the IDP Leaderboard with an overall score of 85.9%, followed by GPT-5.4 (83.5%) and Gemini-3-Pro (82.8%). These scores are averaged across three benchmarks: OlmOCR (OCR quality), OmniDocBench (document parsing), and IDP Core (extraction + VQA).

Question 2

What benchmarks are used in the IDP Leaderboard?

Accepted Answer

The IDP Leaderboard evaluates models on three complementary benchmarks: (1) OlmOCR Bench — tests OCR fidelity on math equations, tables, headers/footers, and reading order across ~7,000 document pages; (2) OmniDocBench v1.5 — comprehensive document parsing evaluation covering text extraction, formula recognition (CDM), table structure (TEDS), and reading order; (3) IDP Core Bench — real-world document processing tasks including key information extraction (KIE), OCR on handwriting, table understanding, and visual question answering (VQA).

Question 3

How are document AI models scored on the IDP Leaderboard?

Accepted Answer

Each model is scored on three benchmarks independently. The overall score is the unweighted mean of the three benchmark scores. Within each benchmark, task-specific metrics are used: OlmOCR uses pass/fail tests, OmniDocBench uses edit distance and TEDS, and IDP Core uses exact-match and similarity metrics. All models are evaluated with identical prompts, images, and scoring pipelines for fair comparison.

Question 4

What is intelligent document processing (IDP)?

Accepted Answer

Intelligent Document Processing (IDP) is the use of AI to automatically read, understand, and extract information from documents such as invoices, receipts, contracts, and forms. IDP combines OCR (optical character recognition), table extraction, key information extraction, and visual question answering to convert unstructured documents into structured, actionable data.

Question 5

Which document AI model is best for OCR?

Accepted Answer

For OCR tasks, Nanonets OCR-3 by Nanonets achieves the highest score of 87.4% on the OlmOCR benchmark, which evaluates math equation rendering, table structure preservation, header/footer handling, and multi-column reading order.

Question 6

Which document AI model is best for table extraction?

Accepted Answer

For table extraction, Nanonets OCR-3 by Nanonets achieves the highest TEDS score of 88.9% on OmniDocBench, measuring table structure and content accuracy.

Question 7

Is the IDP Leaderboard open source?

Accepted Answer

Yes. The evaluation code, benchmark datasets, and scoring pipelines are open source and available on GitHub at https://github.com/NanoNets/docext. All results are reproducible. The leaderboard is sponsored by Nanonets.

#	Model	Overall	OlmOCR	OmniDoc	IDP
1	Nanonets OCR-3Nanonets	85.9	87.4	90.0	80.2
2	GPT-5.4OpenAI	83.5	81.0	85.3	84.4
3	Gemini-3-ProGoogle	82.8	77.7	88.8	81.8
4	Gemini-3-FlashGoogle	82.0	75.3	90.1	80.5
5	Nanonets OCR2+Nanonets	81.8	82.0	89.5	73.8
6	Gemini 3.1 ProGoogle	81.6	69.8	85.3	89.6
7	GPT-5.2OpenAI	81.5	79.1	88.0	77.4
8	Claude Sonnet 4.6Anthropic	80.7	73.9	86.9	81.2
9	Claude Opus 4.6Anthropic	80.4	74.1	85.9	81.1
10	Qwen3-VL-PlusAlibaba	80.1	77.9	82.5	79.8
11	Qwen3-VL-235BAlibaba	79.6	76.8	81.9	80.0
12	Qwen3.5-9BAlibaba	76.7	77.2	76.7	76.2
13	GPT-5-MiniOpenAI	75.2	69.9	82.5	73.3
14	Qwen3.5-4BAlibaba	72.5	75.4	67.6	74.5
15	Mistral Small 4Mistral AI	71.5	69.6	76.4	68.5
16	Claude Haiku 4.5Anthropic	71.2	61.2	79.6	72.9
17	Ministral-8BMistral AI	69.5	58.7	78.3	71.7
18	GPT-4.1OpenAI	68.0	49.4	79.9	74.7
19	GLM-OCRZhipu AI	64.2	68.4	69.2	54.9
20	Qwen3.5-2BAlibaba	62.6	71.9	48.7	67.1
21	Qwen3.5-0.8BAlibaba	57.8	64.8	47.3	61.2
22	GPT-5-NanoOpenAI	54.8	35.2	63.4	65.8
23	Gemma-4-E4B-itGoogle	53.9	47.0	59.7	55.0
24	Llama-3.2-Vision-11BMeta	50.8	49.1	44.6	58.6
25	Pixtral-12BMistral AI	46.5	38.3	42.3	59.0
26	Gemma-4-E2B-itGoogle	41.9	38.2	43.3	44.1

Intelligent Document Processing Leaderboard

About the Leaderboard