Search papers, labs, and topics across Lattice.
3
16
8
8
VLMs can now get a million-scale boost in chart-understanding abilities thanks to a new dataset with paired code, images, data, and reasoning.
LLM-based ASR can be sped up by 4.4x with minimal accuracy loss by using a CTC encoder to speculatively generate draft transcriptions.
A new 2B parameter vision-language model, Granite Vision, rivals larger models on visual document understanding tasks while offering a transparent and commercially-friendly open-source license.