Search papers, labs, and topics across Lattice.
1
0
3
Get 80% of your oracle feedback for free: ROVED leverages vision-language embeddings to drastically reduce the need for human preferences in reinforcement learning.