Search papers, labs, and topics across Lattice.
Huawei 2012 Labs, Hefei, China
1
0
3
2
Freezing most weights and only LoRA-tuning a vision-language model achieves near state-of-the-art multimodal interleaved reasoning performance, proving that targeted adaptation can rival full fine-tuning.