Hengshuang Zhao

Corresponding author

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (2)Multimodal Models (2)Architecture Design (Transformers, SSMs, MoE) (1)Robotics & Embodied AI (1)Training Efficiency & Optimization (1)

Frequent co-authors

Yujia Zhang (1)Xiaoyang Wu (1)Xianzhe Fan (1)Han Li (1)

Papers (3)

Mar 3, 2026

Mar 3, 2026·also Corresponding author, V

Utonia: Toward One Encoder for All Point Clouds

Training a single point cloud encoder across diverse 3D domains not only improves perception but also unlocks emergent behaviors and enhances robotic manipulation and spatial reasoning.

Yujia Zhang, Xiaoyang Wu, Xianzhe Fan +5

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Mar 3, 2026·also ACE Robotics, Corresponding author, NTU, SJTU +2

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments

Spatial reasoning could be the secret sauce for building generalist embodied agents that can drive, manipulate objects, and fly drones, all within a single model.

Ziyang Gong, Zehang Luo, An-Liu Tang +20

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Aug 15, 2025

Junjie Wang +6Aug 15, 2025·also Corresponding author

Generalized Decoupled Learning for Enhancing Open-Vocabulary Dense Perception

CLIP's image tokens struggle to aggregate information from spatially or semantically related regions, but DeCLIP fixes this by decoupling self-attention and distilling knowledge from VFMs and diffusion models.

Junjie Wang, Keyu Chen, Yulin Li +4

Computer Vision Multimodal Models

Search

Hengshuang Zhao

Research focus

Frequent co-authors

Papers (3)