Search papers, labs, and topics across Lattice.
1
0
3
Forget slow reranking: this new method compresses documents into embeddings, letting an 8B parameter model run up to 18x faster than smaller models with better accuracy.