Search papers, labs, and topics across Lattice.
University of Massachusetts Amherst
1
0
4
6
Current benchmarks fail to rigorously evaluate deep research agents, but a new framework leveraging structured knowledge bases and synthetic data offers a verifiable and scalable solution.