Search papers, labs, and topics across Lattice.
2
0
5
BabelRS disentangles modality alignment from downstream task learning in multi-modal remote sensing, leading to more stable training and improved detection accuracy.
Forget incomplete video descriptions: a new dataset and training pipeline yields video captioning models that are competitive with Gemini-3-Pro while reducing hallucinations.