Search papers, labs, and topics across Lattice.
2
0
5
Zeroth-order fine-tuning can be sped up by over 8x by reframing it as an inference workload and executing it within a serving runtime.
LLMs can now leverage a hierarchical graph structure for memory retrieval, enabling global reasoning and boosting performance on long-term memory benchmarks beyond what's achievable with similarity search alone.