Search papers, labs, and topics across Lattice.
3
0
5
0
Self-summarization in LLMs can enhance reasoning coherence and reduce context exhaustion, leading to a 4% performance boost with shorter rollouts.
Shifting credit assignment to fine-grained decision points boosts agentic RL performance by nearly 4 points, challenging the conventional focus on tool-call boundaries.
Bootstrapping LLM agents to co-evolve as both agent and environment can lead to significant performance gains, with an average improvement of over 4% on complex tasks.