Search papers, labs, and topics across Lattice.
Genomics Princeton University Princeton
1
0
3
Exact attention over billion-token sequences is now possible on a single GPU, thanks to a novel streaming approach that avoids out-of-memory errors without approximation.