Search papers, labs, and topics across Lattice.
Shanghai Jiao Tong University
2
0
4
PermaVid achieves unprecedented long-term consistency in video generation, even after significant edits, by disentangling appearance and geometry in its memory architecture.
Semantic visual-action tokenization in RepWAM significantly enhances robotic manipulation performance, outperforming traditional reconstruction-based approaches.