Shaohan Huang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Architecture Design (Transformers, SSMs, MoE) (3)Inference & Quantization (3)Training Efficiency & Optimization (2)Natural Language Processing (1)

Frequent co-authors

Li Dong (3)Furu Wei (3)Li Dong (2)Tianzhu Ye (2)

Papers (4)

Apr 1, 2026

Yutao Sun +53w ago

Universal YOCO for Efficient Depth Scaling

By cleverly combining YOCO's efficient attention with recursive computation, YOCO-U achieves a capability-efficiency sweet spot that neither technique can reach on its own.

Yutao Sun, Li Dong, Tianzhu Ye +3

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Mar 17, 2026

Tianzhu Ye +8Mar 17, 2026·also Qinzheng Sun1

Online Experiential Learning for Language Models

Language models can learn directly from real-world user interactions, boosting performance without human annotations or simulated environments.

Tianzhu Ye, Li Dong, Li Dong +6

Natural Language Processing RLHF & Preference Learning Tool Use & Agents

Mar 5, 2026

Microsoft ResearchMar 5, 2026·also BIT, PKU, Qinzheng Sun1

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

1.58-bit LLMs are surprisingly more resilient to sparsity than their full-precision counterparts, opening new avenues for extreme compression.

Di Zhang, Xun Wu, Shaohan Huang +9

Architecture Design (Transformers, SSMs, MoE)Inference & Quantization Training Efficiency & Optimization

Microsoft ResearchMar 5, 2026·also BIT, Qinzheng Sun1

SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

Unlock 33% faster LLM inference on commodity GPUs with SlideSparse, which finally brings hardware-accelerated (2N-2):2N sparsity to the masses, bridging the accuracy gap left by NVIDIA's strict 2:4 pruning.

Hanyong Shao, Yingbo Hao, Ting Song +9

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Search

Shaohan Huang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)