Search papers, labs, and topics across Lattice.
1
0
3
Slash RAG latency by an order of magnitude using a tiny, LoRA-adapted SLM that routes queries, achieving GPT-4o-mini level accuracy at a fraction of the cost.