Search papers, labs, and topics across Lattice.
Delft University of Technology Delft, 2628 XE, The Netherlands
2
0
4
By learning to mask attention weights, SMAP enables reinforcement learning agents to generalize far better to unseen environments in Procgen.
Random Network Distillation, a computationally cheap uncertainty method, is theoretically equivalent to both deep ensembles and Bayesian inference under certain conditions, finally giving it a solid theoretical footing.