Search papers, labs, and topics across Lattice.
2
0
4
0
Unleashing LLMs' reasoning powers on speech unlocks a new ASR paradigm, slashing error rates by up to 17% simply by having the model "think" before transcribing.
Agent systems leveraging iterative tool orchestration and cross-modal analysis significantly outperform single models in audio reasoning, highlighting a promising path toward explainable audio intelligence.