Search papers, labs, and topics across Lattice.
Q. Wu, B. Fang, and Antoni B. Chan are with the Department of Computer Science, City University of Hong Kong, China (e-mail: qiangqwu2-c@my.cityu.edu.hk; bofang6-c@my.cityu.edu.hk; abchan@cityu.edu.hk). Q. Wu is also with the Department of Electrical and Computer Engineering, Princeton University, Princeton, NJ 08544 USA.T. Yang is with Meituan, Shenzhen, China. (e-mail: tianyu-yang@outlook.com)J. Wan is with the School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen 518066, China (e-mail: jiawan1998@gmail.com).Matias Di Martino is with the Department of Electrical and Computer Engineering, Duke University, Durham, NC 27708 USA.Guillermo Sapiro is with the Department of Electrical and Computer Engineering, Princeton University, Princeton, NJ 08544 USA
1
0
3
By cleverly using readily available video segmentation masks, this method boosts DINOv2's point tracking performance by over 14% – a surprisingly effective way to inject temporal awareness into static image-pretrained models.