Search papers, labs, and topics across Lattice.
Carnegie Mellon University, University of Maryland
1
0
3
LLMs can leverage "sleep" to distill long contexts into fast weights, unlocking superior reasoning without sacrificing inference latency.