Search papers, labs, and topics across Lattice.
MiniMax, Huazhong University of Science and Technology
1
0
2
2
MSA slashes per-token attention compute by over 28x while maintaining competitive performance, revolutionizing how LLMs can handle ultra-long contexts.