Search papers, labs, and topics across Lattice.
Zhejiang University, Hangzhou High-Tech Zone (Binjiang, Institute of Blockchain and Data Security
2
0
4
Achieve diffusion-level perceptual quality in monocular depth estimation at 40x the speed, by replacing the slow initial diffusion steps with a fast ViT-based depth map and refining in a compact latent space.
Achieve faithful textile pattern generation by disentangling clothing features and guiding a diffusion model with fine-grained alignment, outperforming existing image-to-image methods.