Search papers, labs, and topics across Lattice.
Hong Kong University of Science and Technology, Southern University of Science and Technology
1
0
2
8
Standard RL rollouts can effectively provide world modeling supervision, leading to significant performance gains in language agents.