Search papers, labs, and topics across Lattice.
1
0
3
4
Mamba models have hidden performance just waiting to be unlocked: simply scaling the activations of identified "bottleneck" subspaces boosts performance by over 8% across diverse tasks.