SmolDocling

256M VLM for end-to-end document AI

SmolDocling SmolDocling

SmolDocling, from Hugging Face and IBM Research, is the ultra-compact (256M) open VLM for end-to-end document conversion. Extracts text, layout, tables, code, and more from images.

Imagen 2 Imagen 3 Imagen 4