Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Minzheng Wang | Lattice

Minzheng Wang

Institute of Automation, Chinese Academy of Sciences

Papers on Lattice

1

Total citations

0

Topics

3

h-index

8

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Training Efficiency & Optimization (1)

Frequent co-authors

Yuqiao Tan (1)Bo Liu (1)Zichen Liu (1)Tian Liang (1)

Papers (1)

Apr 15, 2026

Apr 15, 2026

From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space

LLMs can be made to reason much better by directly optimizing their pre-training output distribution, even before fine-tuning on specific tasks.

Yuqiao Tan, Minzheng Wang, Bo Liu +2

Reasoning & Chain-of-Thought RLHF & Preference Learning Training Efficiency & Optimization