Search papers, labs, and topics across Lattice.
Tsinghua University ♡ Shanghai AI Laboratory
1
0
2
By harnessing implicit supervision from environment dynamics, EnvRL boosts RL success rates by over 4% on long-horizon tasks, revealing a new frontier in agentic learning.