Search papers, labs, and topics across Lattice.
1
0
3
Ditch inefficient MARL critic learning: fine-tune a pre-trained vision-language model to evaluate multi-agent behavior and drastically improve sample efficiency.