Search papers, labs, and topics across Lattice.
Hefei University of Technology, Hefei, China
1
0
2
5
Forget hand-crafted benchmarks: this paper shows how LLMs can continuously generate relevant evaluation datasets for enterprise AI agents from just a few semi-structured documents.