Search papers, labs, and topics across Lattice.
Zhejiang University
1
0
3
Achieve diffusion-level perceptual quality in monocular depth estimation at 40x the speed, by replacing the slow initial diffusion steps with a fast ViT-based depth map and refining in a compact latent space.