Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Keqiang Li | Lattice

Keqiang Li

Papers on Lattice

1

Total citations

0

Topics

3

h-index

0

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Training Efficiency & Optimization (1)

Frequent co-authors

Shiqi Liu (1)Zeyu He (1)Guojian Zhan (1)Zhilong Zheng (1)

Papers (1)

Feb 17, 2026

3w ago

STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens

A mere 0.01% of tokens can destabilize LLM reinforcement learning, but masking their gradient updates unlocks significant performance gains.

Shiqi Liu, Zeyu He, Guojian Zhan +8

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization

Kehua Sheng (1)