Search papers, labs, and topics across Lattice.
2
0
4
4
Skill0.5 achieves state-of-the-art out-of-distribution generalization in agentic RL by intelligently combining skill internalization and utilization, outperforming methods that rely solely on one or the other.
LLM agents can internalize skills via in-context RL, achieving zero-shot autonomous behavior without the token overhead and retrieval noise of traditional methods.