Search papers, labs, and topics across Lattice.
3
0
7
0
Forget finetuning: TTA-Vid adapts video reasoning models to new datasets *during inference* using test-time reinforcement learning, achieving state-of-the-art results without any labels.
LLM-based ASR can be sped up by 4.4x with minimal accuracy loss by using a CTC encoder to speculatively generate draft transcriptions.
Ditch slow, sequential decoding: NLE achieves 27x speedup over autoregressive ASR by using a non-autoregressive, LLM-based transcript editing approach.