Search papers, labs, and topics across Lattice.
IBM Research
2
0
4
Speaker-attributed ASR gets a serious boost from jointly training speaker cluster tags within a speech-aware LLM, outperforming traditional pipelines.
Despite advances in expressive speech, current TTS systems often miss subtle but crucial contextual cues, failing to emphasize the correct words even when the context makes the intended meaning clear.