SmolDocling: An ultra-compact VLM for end-to-end multi-modal document conversion
(arxiv.org)
We introduce SmolDocling, an ultra-compact vision-language model targeting end-to-end document conversion.
We introduce SmolDocling, an ultra-compact vision-language model targeting end-to-end document conversion.