Search papers, labs, and topics across Lattice.
Department of Data Science, Information System, College of Computer Science and Technology, City University of Hong Kong Hong Kong, Zhejiang University of Technology Zhejiang, City University of Hong Kong, Baidu Inc
2
0
4
A lightweight architecture that distills long textual sequences using visual tokens as dynamic queries boosts LLM performance on 2D table understanding by 23.9%.
Forget independent feature extraction: a new architecture uses LVLMs to explicitly model the relationships between drone and satellite imagery, substantially boosting geolocalization accuracy.