Search papers, labs, and topics across Lattice.
Department of Data Science, Information System, College of Computer Science and Technology, City University of Hong Kong Hong Kong, Zhejiang University of Technology Zhejiang, City University of Hong Kong
3
0
4
A lightweight architecture that distills long textual sequences using visual tokens as dynamic queries boosts LLM performance on 2D table understanding by 23.9%.
LLMs can now retrieve memories like humans, using a fast familiarity check or a deliberate recollection process, leading to better personalization without overwhelming the model with irrelevant context.
Forget independent feature extraction: a new architecture uses LVLMs to explicitly model the relationships between drone and satellite imagery, substantially boosting geolocalization accuracy.