Search papers, labs, and topics across Lattice.
College of Computer Science and Technology, Zhejiang University, State Key Laboratory of Blockchain and Security, Zhejiang University
4
0
8
Pruning detrimental LoRA modules can lead to substantial performance gains in multi-task models, challenging the assumption that all components contribute positively.
DriveFix tackles the "shaky camera" problem in 4D driving scene reconstruction, producing significantly more stable and coherent novel views by explicitly modeling spatio-temporal dependencies.
Achieve diffusion-level perceptual quality in monocular depth estimation at 40x the speed, by replacing the slow initial diffusion steps with a fast ViT-based depth map and refining in a compact latent space.
VLMs still can't reason about spatial logic in real-world scenes, but a new benchmark and scene graph method shows how to make progress.