Search papers, labs, and topics across Lattice.
1
0
3
Achieve 3x faster video captioning without sacrificing accuracy by swapping quadratic attention for a linear Mamba backbone and hierarchical bidirectional scanning.