Zhuoyang Zhang

Papers on Lattice

Total citations

Topics

Research focus

Computer Vision (2)Multimodal Models (1)Reasoning & Chain-of-Thought (1)Architecture Design (Transformers, SSMs, MoE) (1)Inference & Quantization (1)

Frequent co-authors

Hongxu Yin (2)An-Chieh Cheng (1)Yang Fu (1)Yang Fu (1)

Papers (3)

May 28, 2026

NVIDIAMay 28, 2026·also Beihang, HKU, UCSD, University of California

Grounded 3D-Aware Spatial Vision-Language Modeling

Grounding boosts spatial reasoning in VLMs: explicitly linking language to 2D and 3D scene elements lets models decompose complex spatial problems and improve performance even on non-grounded tasks.

An-Chieh Cheng, Yang Fu, Yang Fu +21

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

May 26, 2026

NVIDIAMay 26, 2026·also PI, UPenn

JetViT: Efficient High-Resolution Vision Transformer with Post-Training Attention Search

Get up to 1.79x faster ViT inference on high-resolution images without sacrificing accuracy by surgically replacing full-attention blocks with cheaper alternatives *after* pre-training.

Dongyun Zou, Zhuoyang Zhang, Wenkun He +3

Architecture Design (Transformers, SSMs, MoE)Computer Vision Inference & Quantization

Feb 19, 2026

Luke Huang +3Feb 19, 2026

Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs

Asynchronous RL for LLMs can be sped up 2.5x by explicitly controlling policy-gradient variance, without sacrificing synchronous performance.

Luke Huang, Zhuoyang Zhang, Qinghao Hu +1

Distributed Systems & Hardware RLHF & Preference Learning Training Efficiency & Optimization