Shuohuan Wang

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (2)Multimodal Models (2)Speech & Audio (1)Architecture Design (Transformers, SSMs, MoE) (1)Distributed Systems & Hardware (1)

Frequent co-authors

Yu Sun (2)Longbin Ji (1)Guan Wang (1)Guanao Wang (1)

Papers (3)

May 28, 2026

Longbin Ji +7May 28, 2026

Native Audio-Visual Alignment for Generation

Achieve superior audio-visual generation with a 6.3B parameter model by disentangling alignment and generation, outperforming larger models.

Longbin Ji, Guan Wang, Guanao Wang +5

Computer Vision Multimodal Models Speech & Audio

Apr 6, 2026

Zefeng Zhang +6Apr 6, 2026

CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models

Multimodal models can better understand degraded images by dropping pixel-perfect reconstruction and directly optimizing the latent space for reasoning, leading to higher perceptual quality.

Zefeng Zhang, Zhenyu Zhang, Linhao Yu +4

Computer Vision Multimodal Models

Mar 5, 2026

Mar 5, 2026·also Baidu

Mixture of Universal Experts: Scaling Virtual Width via Depth-Width Transformation

Forget scaling depth and width—MOUE unlocks a new "virtual width" dimension for Mixture-of-Experts by cleverly reusing a single expert pool across layers.

Naibin Gu, Junyuan Shang, Zhenyu Zhang +7

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Training Efficiency & Optimization

Search

Shuohuan Wang

Research focus

Frequent co-authors

Papers (3)