Search papers, labs, and topics across Lattice.
2
0
2
Forget state-action spaces: this work achieves efficient multi-agent imitation learning by concentrating on feature-level representations in linear Markov games.
Stop hand-tuning reward learning losses: MAVRL learns a shared reward function from diverse feedback signals by treating each as a likelihood within a Bayesian inference framework.