Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Jiarui Yao | Lattice

Jiarui Yao

Papers on Lattice

1

Total citations

0

Topics

1

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (1)

Frequent co-authors

Xiangxin Zhou (1)Penghui Qi (1)Wee Sun Lee (1)Liefeng Bo (1)

Papers (1)

Jun 8, 2026

Jiarui Yao +53d ago

Rethinking the Divergence Regularization in LLM RL

Smooth gradient adjustments in DRPO prevent harmful policy shifts, leading to more stable and efficient LLM training.

Jiarui Yao, Xiangxin Zhou, Penghui Qi +3

RLHF & Preference Learning

Tianyu Pang (1)