Search papers, labs, and topics across Lattice.
1
0
3
RL agents can learn far more efficiently by incorporating group-level natural language feedback, achieving 2.2x sample efficiency gains in sparse-reward environments.