Zhongwei Wan

The Ohio State University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (1)Interpretability & Mechanistic Interp (1)Natural Language Processing (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Jing Xiong (3)Zunhai Su (1)Hengyuan Zhang (1)Yifan Zhang (1)

Papers (3)

Apr 11, 2026

Tsinghua AI2w ago·also HKU, Huawei, LongCat Team, Ohio State +3

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Attention Sink, where Transformers fixate on seemingly irrelevant tokens, is more than just a quirk – it's a fundamental challenge impacting training, inference, and even causing hallucinations, demanding a systematic approach to understanding and mitigating its effects.

Zunhai Su, Hengyuan Zhang, Yifan Zhang +13

Architecture Design (Transformers, SSMs, MoE)Interpretability & Mechanistic Interp Natural Language Processing

Mar 16, 2026

Mar 16, 2026·also B. Topic Samples Data source(s), HKU, Independent, Indiana University +3

MMSpec: Benchmarking Speculative Decoding for Vision-Language Models

Text-based speculative decoding falls flat for vision-language models, but ViSkip dynamically adapts to vision tokens for state-of-the-art acceleration.

Yunta Hsieh, Qi Han, Zhongwei Wan +5

Eval Frameworks & Benchmarks Inference & Quantization Multimodal Models

Feb 23, 2026

Feb 23, 2026·also HKU, HUST, Imperial, Shanghai AI Lab +2

DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning

LLMs can reason better if you force them to explore *different* ways of being right, not just be more random.

Zhongwei Wan, Zhongwei Wan, Zhihao Dou +8

Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Zhongwei Wan

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)