Search papers, labs, and topics across Lattice.
Independent
1
0
3
Text-based speculative decoding falls flat for vision-language models, but ViSkip dynamically adapts to vision tokens for state-of-the-art acceleration.