Search papers, labs, and topics across Lattice.
Carnegie Mellon University
1
8
3
7
Forget context window limits: this RL method uses LLM-generated summaries to train agents for long-horizon tasks, achieving higher success rates with less context.