Search papers, labs, and topics across Lattice.
2
0
4
Hybrid TD3 stabilizes reinforcement learning in hybrid action spaces by taming overestimation bias with a theoretically grounded weighted clipped Q-learning target.
Robots can now learn manipulation skills from unstructured videos with significantly improved accuracy and generalization by decoupling video understanding from policy learning.