Search papers, labs, and topics across Lattice.
2
0
4
LLMs can maintain long-context performance even with aggressive KV-cache eviction by learning to predict token importance and compressing evicted tokens into a latent memory.
Forget prompt engineering: MOSS lets autonomous agents rewrite their own source code to fix bugs and improve performance in production.