Search papers, labs, and topics across Lattice.
4
0
4
No existing model can effectively ground the spatial structure of student reasoning in multi-page handwritten homework, revealing a significant gap in automated assessment capabilities.
PreciseDoc achieves unprecedented precision in grounding critical document elements, transforming how LMMs can interpret complex text-rich environments.
Preserving skill-level attention structures in MLLMs can dramatically reduce forgetting while adapting to new tasks without relying on replay mechanisms.
Multimodal perception is no longer just an add-on: GLM-5V-Turbo bakes it directly into the core of reasoning, planning, and action.