Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
-
PaddlePaddle/PaddleOCR-VL-1.5
Image-Text-to-Text β’ 1.0B β’ Updated β’ 22.2k β’ 471 -
PaddleOCR-VL-1.5 Online Demo
π»66PaddleOCR-VL-1.5_Online_Demo
-
PaddlePaddle/PP-DocLayoutV3
Image Segmentation β’ Updated β’ 16.2k β’ 55 -
PaddlePaddle/PP-DocLayoutV3_safetensors
Object Detection β’ Updated β’ 212k β’ 19