Search papers, labs, and topics across Lattice.
Tencent, Tsinghua University
1
0
2
FlashMemory-DeepSeek-V4 slashes GPU memory usage by over 90% for ultra-long contexts while enhancing model accuracy.