Search papers, labs, and topics across Lattice.
2
0
3
MOSS-Audio achieves state-of-the-art performance in audio understanding tasks by effectively integrating temporal cues and deep acoustic features, setting a new benchmark for audio-language models.
Cinematic speech data unlocks more realistic and controllable voice generation from natural language descriptions.