Jiejing Shao

Papers on Lattice

Total citations

Topics

h-index

Research focus

Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Tool Use & Agents (1)

Frequent co-authors

Huihan Tan (1)Xiao-Wen Yang (1)Hao Chen (1)Yi Wen (1)

Papers (1)

Mar 7, 2026

Huihan Tan +9Mar 7, 2026

Hindsight Credit Assignment for Long-Horizon LLM Agents

LLM agents can learn to solve complex, long-horizon tasks much more effectively by using themselves as post-hoc critics to refine Q-values through hindsight reasoning.

Huihan Tan, Xiao-Wen Yang, Hao Chen +7

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Jiejing Shao

Research focus

Frequent co-authors

Papers (1)