Search papers, labs, and topics across Lattice.
2
16
5
5
Layout-preserving text beats pixel-level visual cues for structured data extraction from documents, according to a new benchmark spanning 1,771 unique schemas.
A new 2B parameter vision-language model, Granite Vision, rivals larger models on visual document understanding tasks while offering a transparent and commercially-friendly open-source license.