Shaohan Huang

Qinzheng Sun1

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Training Efficiency & Optimization (3)Code Generation & Program Synthesis (2)RLHF & Preference Learning (2)Data Curation & Synthetic Data (1)

Frequent co-authors

Furu Wei (3)Zongqian Li (2)Yixuan Su (2)Nigel Collier (2)

Papers (3)

Mar 8, 2026

Microsoft ResearchMar 8, 2026·also BIT, Cambridge, Qinzheng Sun1

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

By rethinking RLHF, MicroCoder-GRPO enables smaller code generation models to rival larger counterparts, achieving significant performance gains and revealing 34 training insights.

Zongqian Li, Shaohan Huang, Zewen Chi +5

Code Generation & Program Synthesis RLHF & Preference Learning Training Efficiency & Optimization

Microsoft ResearchMar 8, 2026·also Cambridge, Qinzheng Sun1

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

Forget massive datasets – targeted training on a smaller, carefully curated dataset of challenging competitive programming problems yields 3x faster gains in code generation performance.

Zongqian Li, Tengchao Lv, Shaohan Huang +7

Code Generation & Program Synthesis Data Curation & Synthetic Data RLHF & Preference Learning+1

Apr 16, 2025

Microsoft ResearchApr 16, 2025·also Qinzheng Sun1

BitNet b1.58 2B4T Technical Report

A 1-bit LLM can match the performance of full-precision models, promising huge gains in efficiency.

Shuming Ma, Hongyu Wang, Shaohan Huang +523

Architecture Design (Transformers, SSMs, MoE)Open-Source Models & Weights Training Efficiency & Optimization

Search

Shaohan Huang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)