Search papers, labs, and topics across Lattice.
1
0
2
MADDPG-K scales multi-agent learning by ditching the all-seeing critic for a neighborhood watch, achieving faster training and better performance without the quadratic cost of full observation.