Search papers, labs, and topics across Lattice.
2
0
5
0
By decoupling patch details from semantics, Cheers achieves state-of-the-art multimodal understanding and generation with 4x token compression and only 20% of the training cost of comparable models.
Time-shifted anechoic speech beats early reflections as a training target for universal speech enhancement, leading to better perceptual quality and ASR performance.