Jun-Yan Zhu

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (2)Multimodal Models (2)Reasoning & Chain-of-Thought (1)

Frequent co-authors

Jiachun Jin (1)Zetong Zhou (1)Xiao Yang (1)Hao Zhang (1)

Papers (2)

Apr 2, 2026

Jiachun Jin +7Apr 2, 2026

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Ditching pixel-space translation unlocks a unified model (LatentUM) that reasons across modalities with SOTA results, opening doors to more efficient and aligned visual AI.

Jiachun Jin, Zetong Zhou, Xiao Yang +5

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Jan 2, 2025

CMU MLJan 2, 2025·also Snap Research, TAU

Object-level Visual Prompts for Compositional Image Generation

Achieve semantically coherent image compositions by mixing layout-focused and appearance-focused visual representations in a diffusion model's cross-attention.

Gaurav Parmar, Or Patashnik, K. Wang +515

Computer Vision Multimodal Models

Search

Jun-Yan Zhu

Research focus

Frequent co-authors

Papers (2)