Search papers, labs, and topics across Lattice.
2
0
4
6
LLM agents can learn to elicit crucial information from users by rewarding interaction turns that most reduce uncertainty about the optimal action, leading to better task performance.
Forget scraping messy real-world websites: AutoWebWorld lets you synthesize infinite, perfectly verifiable web interaction data for just $0.04 a pop, dramatically boosting agent performance.