Search papers, labs, and topics across Lattice.
3
0
3
Visualizing the critic's loss landscape reveals distinct characteristics linked to stable vs. unstable learning in online RL, offering a new window into algorithm dynamics.
Visualizing the loss landscape of off-policy RL critics reveals distinct geometric patterns linked to convergence and divergence, offering a new diagnostic tool for understanding optimization dynamics.
Uncover the hidden dynamics of your RL agent with a new visualization framework that reveals how TD errors sculpt the optimization landscape and drive policy updates.