Search papers, labs, and topics across Lattice.
1
0
3
2
LLMs can ace the NL2SQL benchmark, but throw in some typos or rephrase the question, and their performance tanks, especially in agentic settings.