Search papers, labs, and topics across Lattice.
1
0
2
Contamination in NL2SQL benchmarks can inflate LLM accuracy, but SPENCE reveals that newer datasets are largely free from this bias.