Search papers, labs, and topics across Lattice.
2
0
5
A 4B-parameter model, InternVL-U, punches above its weight, outperforming 14B-parameter models in multimodal generation and editing by using a novel data synthesis pipeline and architecture.
Forget hand-annotated 3D datasets: a new automated pipeline generates massive, high-quality 3D spatial intelligence from raw video, unlocking better VLM reasoning.