Search papers, labs, and topics across Lattice.
1
0
3
2
Even the best LLMs fail to follow complex constraints in tool use more than 50% of the time, revealing a critical weakness in real-world agent deployment.