<img width="800" src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/README/Banner.png" alt="Star-history">
English | 简体中文 | 繁體中文 | 日本語 | 한국어 | Français | Русский | Español | العربية
PaddleOCR converts PDF documents and images into structured, LLM-ready data (JSON/Markdown) with industry-leading accuracy. With 70k+ Stars and trusted by top-tier projects like Dify, RAGFlow, and Cherry Studio, PaddleOCR is the bedrock for building intelligent RAG and Agentic applications.
Transforming messy visuals into structured data for the LLM era.
The global gold standard for high-speed, multilingual text spotting.
Performance Leap: PP-OCRv6 achieves +4.6% detection and +5.1% recognition accuracy over PP-OCRv5, surpassing mainstream Vision-Language Models. 5.2× CPU inference speedup end-to-end.

PP-OCRv6 highlights:
2026.05.28: Release of PaddleOCR 3.6.0
PaddleOCR-VL-1.6 highlights:
2026.04.21: Release of PaddleOCR 3.5.0
PaddleOCR-VL series, PP-StructureV3, and PP-DocTranslation now support exporting parsed results to DOCX for convenient viewing and editing in Microsoft Word.PaddleOCR.js, the official browser inference SDK that supports running PP-OCRv5 directly in the browser.2026.01.29: Release of PaddleOCR 3.4.0
2025.10.16: Release of PaddleOCR 3.3.0
Released PaddleOCR-VL:
Model Introduction:
Core Features:
Released PP-OCRv5 Multilingual Recognition Model:
2025.08.21: Release of PaddleOCR 3.2.0
$ claude mcp add PaddleOCR \
-- python -m otcore.mcp_server <graph>