Xiaohan Wang

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Training Efficiency & Optimization (2)Multimodal Models (2)Eval Frameworks & Benchmarks (2)Interpretability & Mechanistic Interp (2)

Frequent co-authors

Guojun Yin (3)Wei Lin (2)Yaocheng Zhang (1)Yuanheng Zhu (1)

Papers (5)

Apr 15, 2026

Yaocheng Zhang +151w ago·also CAS

$\pi$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data

Self-play can be dramatically improved by exploiting the "question construction path" it generates as privileged information for self-distillation, leading to 2-3x faster learning.

Yaocheng Zhang, Yuanheng Zhu, Yuanheng Zhu +13

Data Curation & Synthetic Data Tool Use & Agents Training Efficiency & Optimization+1

Apr 6, 2026

2w ago·also Meta AI, Stanford HAI, Concordia University, Fudan +1

GLANCE: A Global-Local Coordination Multi-Agent Framework for Music-Grounded Non-Linear Video Editing

Music-grounded video editing can now produce significantly more coherent timelines thanks to a novel global-local coordination mechanism that resolves cross-segment conflicts.

Zihao Lin, Haibo Wang, Zhiyang Xu +7

Computer Vision Multimodal Models Speech & Audio

Mar 15, 2026

He Li +3Mar 15, 2026

Fine-tuning MLLMs Without Forgetting Is Easier Than You Think

MLLMs are surprisingly robust to catastrophic forgetting during fine-tuning, needing only simple regularization or data-hybrid training to maintain performance.

He Li, Xiaohan Wang, Kaifeng Lyu +1

Eval Frameworks & Benchmarks Multimodal Models Training Efficiency & Optimization

Mar 9, 2026

Mar 9, 2026·also Beihang, JKU, Meituan, PKU

CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling

Forget noisy, biased LLM evaluators: CDRRM distills preference insights into compact rubrics, letting a frozen judge model leapfrog fully fine-tuned baselines with just 3k training samples.

Dengcan Liu, Fengkai Yang, Xiaohan Wang +4

Constitutional AI & AI Ethics Interpretability & Mechanistic Interp RLHF & Preference Learning

Mar 3, 2026

Mar 3, 2026·also MIT CSAIL, Meituan

SAE as a Crystal Ball: Interpretable Features Predict Cross-domain Transferability of LLMs without Training

Predict how well your LLM will transfer to a new domain *before* fine-tuning, by using sparse autoencoders to spot tell-tale signs of domain shift in the model's representations.

Xiaohan Wang, Guojun Yin, Yisen Wang

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Natural Language Processing

Search

Xiaohan Wang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)