Shan Yang

Current audio-visual models nail unimodal quality but still struggle to make music and dance move together rhythmically, highlighting a key gap TMD-Bench is designed to address.

Xiaoda Yang, Majun Zhang, Changhao Pan +8

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Mar 10, 2026

Mar 10, 2026·also Hunyuan Team, JHU, Tencent AI, UCSC

Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

LLMs can get a 27.8% boost in mathematical reasoning by fusing a hardware-efficient optimal control layer directly into their architecture, enabling planning before prediction.

Peihao Wang, Shan Yang, Shanzhe Yang +9

Distributed Systems & Hardware Reasoning & Chain-of-Thought World Models & Planning

Feb 23, 2026

Hunyuan TeamFeb 23, 2026·also Tencent AI

Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning

Forget scaling bottlenecks: DG-PG uses differentiable analytical models to slash gradient variance in cooperative MARL, achieving convergence in a 200-agent cloud scheduling task where standard methods fail.

Shan Yang

Distributed Systems & Hardware Training Efficiency & Optimization

Search

Shan Yang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)