Kenton Tang

University of Edinburgh

Papers on Lattice

Total citations

Topics

Research focus

RLHF & Preference Learning (1)

Frequent co-authors

Yuzhu Chen (1)Fengxiang He (1)

Papers (1)

Feb 25, 2026

Feb 25, 2026·also USTC

Generalisation of RLHF under Reward Shift and Clipped KL Regularisation

RLHF's generalization gap can be decomposed into distinct error terms arising from reward shift and KL clipping, offering a more nuanced understanding of its limitations.

Kenton Tang, Yuzhu Chen, Fengxiang He

RLHF & Preference Learning

Search

Kenton Tang

Research focus

Frequent co-authors

Papers (1)