Search papers, labs, and topics across Lattice.
University of Amsterdam
1
0
4
7
Current benchmarks fail to rigorously evaluate deep research agents, but a new framework leveraging structured knowledge bases and synthetic data offers a verifiable and scalable solution.