Yunsheng Lu

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Constitutional AI & AI Ethics (1)RLHF & Preference Learning (1)

Frequent co-authors

Zijiang Yang (1)Licheng Pan (1)Zhixuan Chu (1)

Papers (1)

Apr 15, 2026

Apr 15, 2026·also UChicago

Robust Reward Modeling for Large Language Models via Causal Decomposition

Reward models can be made more robust to spurious cues like length and sycophancy by explicitly training them to understand the *intent* behind a prompt.

Yunsheng Lu, Zijiang Yang, Licheng Pan +1

Constitutional AI & AI Ethics RLHF & Preference Learning

Search

Yunsheng Lu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)