Search papers, labs, and topics across Lattice.
Indiana University
2
0
5
Text-based speculative decoding falls flat for vision-language models, but ViSkip dynamically adapts to vision tokens for state-of-the-art acceleration.
LLMs can perfectly cluster speakers in overlapping multi-party conversations, enabling near-perfect Joint ASR-Clustering Error Rate in challenging CHiME-9 tasks.