Search papers, labs, and topics across Lattice.
1
0
2
A simple value normalization technique unlocks the potential of offline multi-agent RL by stabilizing non-linear value decomposition, a notoriously unstable component.