Search papers, labs, and topics across Lattice.
Beijing Institute of Technology
1
0
3
PathRouter reduces reliance on shortcuts in reinforcement learning, leading to more reliable and contextually rich decision-making in language-model agents.