Search papers, labs, and topics across Lattice.
UT Austin, The University of Texas at Austin
1
0
2
4
Forget hand-crafted benchmarks: this paper shows how LLMs can continuously generate relevant evaluation datasets for enterprise AI agents from just a few semi-structured documents.