Search papers, labs, and topics across Lattice.
1
0
3
By weighting model rollouts based on predicted confidence, WIMLE achieves state-of-the-art sample efficiency in model-based RL, outperforming existing methods on complex continuous control tasks.