Julian J. McAuley

University of California, San Diego

Papers on Lattice

Total citations

Topics

h-index

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)

Frequent co-authors

Yu Xia (1)Canwen Xu (1)Zhewei Yao (1)Yuxiong He (1)

Papers (1)

Apr 1, 2026

Apr 1, 2026·also Snowflake AI Research

Learning to Hint for Reinforcement Learning

Stop hand-crafting hints for RL agents: HiLL learns to generate adaptive hints that actually improve the agent's performance on the original task, not just the hinted one.

Yu Xia, Canwen Xu, Zhewei Yao +2

Reasoning & Chain-of-Thought RLHF & Preference Learning

Search

Julian J. McAuley

Research focus

Frequent co-authors

Papers (1)