Search papers, labs, and topics across Lattice.
Institute for Interdisciplinary Information Sciences, Tsinghua University
1
0
3
LLM agents get stuck in error feedback loops, but ProCeedRL's process-level critic and reflection-based demonstrations can actively break these cycles and substantially improve exploration.