Search papers, labs, and topics across Lattice.
1
0
3
5
Real-time video generation just got a whole lot faster: Monarch-RT achieves up to 95% attention sparsity without quality loss and outperforms FlashAttention, finally enabling 16 FPS video generation on a single GPU.