Search papers, labs, and topics across Lattice.
IBM Research
3
0
3
Forget phoneme sequences and G2P systems: this work shows you can boost ASR accuracy for rare words by cleverly leveraging acoustic cues from common words with similar sounds.
Speaker-attributed ASR gets a serious boost from jointly training speaker cluster tags within a speech-aware LLM, outperforming traditional pipelines.
Ditch slow, sequential decoding: NLE achieves 27x speedup over autoregressive ASR by using a non-autoregressive, LLM-based transcript editing approach.