Search papers, labs, and topics across Lattice.
1
0
2
Ditch slow diffusion policies: FMER achieves 7x faster training and superior performance in sparse reward RL by using flow matching and a tractable entropy regularization term.