Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
-
Updated
May 13, 2026 - Python
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
A lightweight, type-safe, PaddlePaddle PP-DocLayoutV3 & V2 implementation in Bun/Node.js for document layout analysis in JavaScript environments.
Add a description, image, and links to the doclayout topic page so that developers can more easily learn about it.
To associate your repository with the doclayout topic, visit your repo's landing page and select "manage topics."