Search papers, labs, and topics across Lattice.
2
11
6
11
Stop wasting precious GPU memory: this new cache-semantic hash table library achieves up to 3.9 billion key-value lookups per second, outperforming standard approaches by up to 9.4x.
Forget closed-source embedding models: llama-embed-nemotron-8b just topped the MMTEB leaderboard with fully open weights and a data recipe you can actually reproduce.