Search papers, labs, and topics across Lattice.
East China Normal University
1
0
3
4
Skill0.5 achieves state-of-the-art out-of-distribution generalization in agentic RL by intelligently combining skill internalization and utilization, outperforming methods that rely solely on one or the other.