Guanhua Chen

Southern University of Science and Technology

Papers on Lattice

Total citations

Topics

h-index

Research focus

Natural Language Processing (1)RLHF & Preference Learning (1)Training Efficiency & Optimization (1)

Frequent co-authors

Zeguan Xiao (1)Yun Chen (1)

Papers (1)

Jun 11, 2025

Towards Bridging the Reward-Generation Gap in Direct Alignment Algorithms

LLMs can be coaxed into better alignment with human preferences by simply truncating training responses to equal lengths, forcing the model to focus on the crucial prefix tokens often overlooked by standard Direct Alignment Algorithms.

Zeguan Xiao, Yun Chen, Guanhua Chen

Natural Language Processing RLHF & Preference Learning Training Efficiency & Optimization

Search

Guanhua Chen

Research focus

Frequent co-authors

Papers (1)