Chonghuan Wang

MIT. Email: ruiai

MIT CSAIL

Papers on Lattice

Total citations

Topics

Research focus

Recommendation & Information Retrieval (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Rui Ai (1)David Simchi-Levi (1)

Papers (1)

Mar 31, 2026

MIT CSAILMar 31, 2026·also Caltech, Department of Civil and Environmental, Department of Computing and Mathematical, Georgia Tech +7

ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training

Stop rewarding all LLM-generated candidates equally: ShapE-GRPO uses Shapley values to fairly distribute credit within sets, leading to better training and faster convergence.

Rui Ai, David Simchi-Levi, Chonghuan Wang

Recommendation & Information Retrieval RLHF & Preference Learning Tool Use & Agents

Search

Chonghuan Wang

Research focus

Frequent co-authors

Papers (1)