Search papers, labs, and topics across Lattice.
Independent Researcher
1
0
3
Even the top-performing conversational agents struggle with reliability, hitting only 57% accuracy on a new benchmark designed to test agentic recommender systems.