Search papers, labs, and topics across Lattice.
Shanghai AI Laboratory, University of Science and Technology of China
2
0
6
4
A 4B-parameter model, InternVL-U, outperforms 14B-parameter models in multimodal generation and editing, proving that size isn't everything.
Spatial reasoning could be the secret sauce for building generalist embodied agents that can drive, manipulate objects, and fly drones, all within a single model.