Search papers, labs, and topics across Lattice.
1
0
3
Linear attention models can mimic quadratic attention in-context learning for simple tasks like linear regression, but with limitations that this paper elucidates.