Search papers, labs, and topics across Lattice.
4
0
6
0
NL2SQL evaluation has a new champion: ROSE, an intent-centered metric that aligns 24% better with human judgment than existing metrics.
Forget slow, bloated LLMs – this work shows you can get GPT-4o quality on long-document QA with a 3B model and a clever structure-first distillation approach.
Forget vector embeddings: DocSage uses SQL-powered indexing and relational tables to achieve 27% higher accuracy on multi-document question answering.
Stop struggling with SQL dialects: Dial offers a knowledge-grounded approach that boosts NL2SQL accuracy by 10% and feature coverage by 15% across diverse database systems.