Search papers, labs, and topics across Lattice.
University of Pisa
1
0
3
Compact, gradient-free MARS models can now outperform state-of-the-art gradient-based sequence models like Mamba, while slashing training times from hours to milliseconds.