Search papers, labs, and topics across Lattice.
2
0
5
14
Achieve near-perfect speech recognition at a ridiculously low 200 bits per second by using reinforcement learning to directly optimize a neural codec for intelligibility.
By explicitly modeling 3D space with learned spatial audio representations, JAEGER enables AV-LLMs to perform joint spatial grounding and reasoning far beyond the capabilities of 2D-centric models.