Search papers, labs, and topics across Lattice.
Independent Research
1
0
2
Forget external rewards—this agent learns to explore and adapt by prioritizing its own ignorance, surprise, and staleness, outperforming fixed strategies.