Search papers, labs, and topics across Lattice.
2
0
4
MOSS-Audio achieves state-of-the-art performance in audio understanding tasks by effectively integrating temporal cues and deep acoustic features, setting a new benchmark for audio-language models.
VLAs learn to predict task success even when trained only with imitation learning, opening the door to improved performance without expensive reward engineering.