Search papers, labs, and topics across Lattice.
1
0
3
LLM agents can learn complex, multi-turn tasks far more effectively by explicitly separating planning from execution, using a hierarchical RL approach with carefully designed credit assignment.