Search papers, labs, and topics across Lattice.
HS Delft
4
0
5
Visualizing the critic's loss landscape reveals distinct characteristics linked to stable vs. unstable learning in online RL, offering a new window into algorithm dynamics.
Visualizing the loss landscape of off-policy RL critics reveals distinct geometric patterns linked to convergence and divergence, offering a new diagnostic tool for understanding optimization dynamics.
Uncover the hidden dynamics of your RL agent with a new visualization framework that reveals how TD errors sculpt the optimization landscape and drive policy updates.
Transfer learning from non-space flare datasets can effectively segment straylight in space camera images, even with limited space-specific training data.