Search papers, labs, and topics across Lattice.
Boston University
2
0
4
Skip the training and the hyperparameter tuning: Swift Sampling uses Taylor series to find the most informative frames in a video, beating existing methods with a 30x reduction in overhead.
VLMs, despite excelling at semantic tasks, are surprisingly brittle when faced with basic geometric transformations like rotations and scaling.