Search papers, labs, and topics across Lattice.
Carnegie Mellon University
1
0
3
Personalization is key: agents struggle with multi-app tasks, achieving only 37% accuracy despite an overall score of 52%.