Search papers, labs, and topics across Lattice.
Universiti Sains Malaysia
2
0
5
Discriminative readout from an omni-modal LLM slashes MSA error rates and latency, outperforming traditional generative methods by a wide margin.
Achieve real-time, high-fidelity speech synthesis with a 48ms latency by directly generating compressed audio codec tokens, bypassing the neural vocoder bottleneck.