Search papers, labs, and topics across Lattice.
3
0
6
23
RLVR, the dominant training paradigm for audio language models, may be turning them into unfeeling "answering machines" that excel on benchmarks but fail the vibe check.
Mimicking human cognition, FLAIR lets dialogue models "think while listening," boosting performance without adding latency.
Turns out your always-on speech dialogue model is leaking speaker identity like a sieve, but a simple feature-domain anonymization technique can boost privacy by 3.5x with minimal impact on performance.