Search papers, labs, and topics across Lattice.
Fudan University, Shanghai Innovation Institute 鈭桬qual Contribution 鈥roject Lead
3
0
4
Gradual bridging with embodied trajectory-coupled data transforms VLMs into robust robot control policies, overcoming significant transfer challenges.
One-step action generation in VLA models can outperform ten-step methods by simply biasing training towards high-noise states, challenging the need for complex iterative processes.
MOSS-Audio achieves state-of-the-art performance in audio understanding tasks by effectively integrating temporal cues and deep acoustic features, setting a new benchmark for audio-language models.