Yang Zhou

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (6)Multimodal Models (4)Robotics & Embodied AI (3)World Models & Planning (3)Distributed Systems & Hardware (2)

Frequent co-authors

Jing Ma (1)Jingyu Ma (1)Yibo Peng (1)Zhenguo Sun (1)

Papers (10)

Apr 30, 2026

ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control

Forget painstakingly programming robot interactions – ExoActor uses video generation to hallucinate plausible behaviors, then translates them into robot actions.

Yang Zhou, Jing Ma, Jingyu Ma +6

Computer Vision Robotics & Embodied AI World Models & Planning

Yang Zhou +7Apr 30, 2026

A Real-time Scale-robust Network for Glottis Segmentation in Nasal Transnasal Intubation

Real-time glottis segmentation during Nasotracheal Intubation just got a whole lot faster and more accurate, thanks to a new network that's both lightweight and scale-robust.

Yang Zhou, Yang Zhou, Chaoyong Zhang +5

Computer Vision Robotics & Embodied AI

Apr 21, 2026

Huazhong Agricultural UniversityApr 21, 2026·also HUST

DINO Eats CLIP: Adapting Beyond Knowns for Open-set 3D Object Retrieval

DINO, not CLIP, might be the better foundation for open-set 3D object retrieval, especially when paired with dynamic view integration and virtual feature synthesis to avoid overfitting.

Xinwei He, Yansong Zheng, Qianru Han +7

Computer Vision Multimodal Models Recommendation & Information Retrieval

Apr 19, 2026

Chon Lam Lao +8Apr 19, 2026

CCCL: In-GPU Compression-Coupled Collective Communication

Get up to 10% more throughput on your LLM disaggregation workloads just by swapping in this drop-in collective communications library with built-in compression.

Chon Lam Lao, Zhiying Xu, Zhuang Wang +6

Distributed Systems & Hardware Inference & Quantization Training Efficiency & Optimization

Apr 13, 2026

Yang Zhou +2Apr 13, 2026

Unfolding 3D Gaussian Splatting via Iterative Gaussian Synopsis

Compressing 3D Gaussian Splatting models by iteratively "unfolding" them from a full-resolution version yields surprisingly compact representations without sacrificing rendering quality.

Yang Zhou, Yihua Dai, Guiqing Li

Computer Vision Inference & Quantization

Apr 8, 2026

Apr 8, 2026·also Key Laboratory of Cyberspace Security, NUDT

How Independent are Large Language Models? A Statistical Framework for Auditing Behavioral Entanglement and Reweighting Verifier Ensembles

LLMs are far more alike than you think: shared biases and failure modes mean that ensembling them is less effective than you'd hope.

Chenchen Kuai, Jiwan Jiang, Zihao Zhu +6

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness

Apr 7, 2026

Rui Tang +4Apr 7, 2026·also Florida Institute of Technology

PanopticQuery: Unified Query-Time Reasoning for 4D Scenes

Answering complex questions about 4D scenes just got a whole lot better: PanopticQuery leverages multi-view semantic consensus to transform noisy, view-dependent predictions into globally consistent 4D interpretations.

Rui Tang, Yang Zhou, Zhong Ye +2

Computer Vision Multimodal Models Natural Language Processing

Apr 2, 2026

Yang Zhou +10Apr 2, 2026

DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning

Explicitly modeling depth in world-action models significantly boosts planning robustness and future prediction quality for autonomous driving.

Yang Zhou, Xiaofeng Wang, Hao Shao +8

Multimodal Models Robotics & Embodied AI World Models & Planning

Introduction Figure 1: Training loss vs. wall-clock time. EC reaches loss 3.75 in 10.6hApr 2, 2026·also Cornell, Duke, Scitix, UC Davis

Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models

Diffusion language models can achieve faster convergence and improved accuracy simply by swapping token-choice routing for expert-choice routing, and further benefit from allocating more compute to early denoising steps.

Shuibai Zhang, Caspian Zhuang, Chihan Cui +8

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Mar 4, 2026

AI2Mar 4, 2026·also Tsinghua AI, Adobe Research, Dolby Laboratories, Oregon +2

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Finally, AI can generate hour-long videos with consistent characters and backgrounds, thanks to a new framework that nails seamless transitions between shots.

Mohamed Elmoghany, Liangbing Zhao, Xiaoqian Shen +28

Computer Vision Multimodal Models World Models & Planning

Search

Yang Zhou

Research focus

Frequent co-authors

Papers (10)