Search papers, labs, and topics across Lattice.
2
0
5
A simple value normalization technique unlocks the potential of offline multi-agent RL by stabilizing non-linear value decomposition, a notoriously unstable component.
Factored world models can disentangle the dynamics of multiple interacting entities, leading to more controllable video generation and improved policy learning.