Search papers, labs, and topics across Lattice.
2
0
4
17
Just 4 hours of speech data closes the modality gap in LLM-based ASR, rivaling full-dataset fine-tuning and unlocking effective domain adaptation.
LLM-based ASR can get a context boost without the compute cost: compress prior audio turns into learned latent tokens and retain transcripts to recover accuracy while shrinking the audio footprint.