Search papers, labs, and topics across Lattice.
4
0
6
2
Achieve 3x faster video captioning without sacrificing accuracy by swapping quadratic attention for a linear Mamba backbone and hierarchical bidirectional scanning.
A lightweight VLA with deep state space models lets robots outperform larger models at language-guided manipulation while running 3x faster.
Nail design retrieval gets a major upgrade: NaiLIA leverages dense intent descriptions and palette queries to outperform standard methods, opening the door to more nuanced and personalized image search.