Search papers, labs, and topics across Lattice.
2
0
5
FlashMemory-DeepSeek-V4 slashes GPU memory usage by over 90% for ultra-long contexts while enhancing model accuracy.
Forget reward engineering: this work shows LLMs can self-evolve and outperform larger models by learning to explore and summarize new environments autonomously.