Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Jiaqi Wang | Lattice

Jiaqi Wang

Beijing Academy of Artificial Intelligence

Papers on Lattice

2

Total citations

0

Topics

4

h-index

3

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (2)Training Efficiency & Optimization (2)Architecture Design (Transformers, SSMs, MoE) (1)Computer Vision (1)

Frequent co-authors

Chuanyu Qin (1)Chen Yang (1)Chenxu Yang (1)Qingyi Si (1)

Papers (2)

Apr 22, 2026

3d ago·also BAAI

Near-Future Policy Optimization

Forget external teachers – the best way to boost your RL policy might be learning from its future self.

Chuanyu Qin, Chen Yang, Chenxu Yang +9

RLHF & Preference Learning Training Efficiency & Optimization

Apr 20, 2026

5d ago·also DAMO, Tsinghua AI, BUPT

UDM-GRPO: Stable and Efficient Group Relative Policy Optimization for Uniform Discrete Diffusion Models

RL fine-tuning of discrete diffusion models can be made dramatically more stable and effective by treating the final denoised sample as the action and reconstructing trajectories using the forward diffusion process.

Jiaqi Wang, Haoge Deng, Ting Pan +10

Architecture Design (Transformers, SSMs, MoE)Computer Vision RLHF & Preference Learning+1

Naibin Gu (1)