Search papers, labs, and topics across Lattice.
1
0
3
Linear RNNs achieve transformer-like parallelization because they're essentially log-depth arithmetic circuits, while nonlinear RNNs are fundamentally limited by their ability to solve computationally harder problems.