Search papers, labs, and topics across Lattice.
2
0
4
7
Achieve faster, more accurate turn-taking in spoken dialogue by fusing streaming speech recognition with raw audio cues – even when it's noisy.
A fully open-source speech understanding model, OSUM-Pangu, proves that competitive performance is achievable on non-CUDA hardware, challenging the dominance of GPU-centric ecosystems.