Search papers, labs, and topics across Lattice.
OPPO AI Center
2
0
5
Gradient explosions in token-level distillation are tamed by a novel normalization technique, leading to robust improvements in multimodal reasoning tasks.
VLMs can achieve up to 4.2x faster inference by simply skipping redundant pixels before they even enter the Vision Transformer.