Search papers, labs, and topics across Lattice.
4
0
6
4
SIRI allows LLM agents to autonomously develop and internalize skills, achieving up to a 2.2% performance boost without external dependencies.
MLLMs excel at single-hop tasks but falter dramatically in open-world scenarios, revealing critical gaps in their reasoning capabilities.
LLM agents can internalize skills via in-context RL, achieving zero-shot autonomous behavior without the token overhead and retrieval noise of traditional methods.
Forget hand-tuning rollout budgets: $V_{0.5}$ dynamically allocates compute to sparse RL rollouts based on a real-time statistical test of a generalist value model's prior, slashing variance and boosting performance.