Search papers, labs, and topics across Lattice.
[, Department of Data Science, Information System, College of Computer Science and Technology, City University of Hong Kong Hong Kong, Zhejiang University of Technology Zhejiang
2
0
4
A lightweight architecture that distills long textual sequences using visual tokens as dynamic queries boosts LLM performance on 2D table understanding by 23.9%.
Forget independent feature extraction: a new architecture uses LVLMs to explicitly model the relationships between drone and satellite imagery, substantially boosting geolocalization accuracy.