D visual featuresTongjiWHUFeb 24, 2026arXiv:2602.20630

From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection

Yepeng Liu, Liwen Yang, Fangzhen Li, Xudi Ge, Yuliang Gu, kuang Gao, Guang Chen, Hangjun Ye, Yongchao Xu

AI Summary

The paper introduces TraqPoint, a reinforcement learning framework that optimizes keypoint detection for long-term trackability in image sequences, addressing the limitations of existing pair-based training methods. TraqPoint employs a track-aware reward mechanism that promotes both consistency and distinctiveness of keypoints across multiple views. Experiments on sparse matching benchmarks demonstrate that TraqPoint outperforms state-of-the-art keypoint detection and description methods in relative pose estimation and 3D reconstruction.

Key Contribution

Keypoint detectors can now be trained with RL to directly optimize for long-term trackability across image sequences, leading to significant improvements in downstream 3D vision tasks.

Abstract

Keypoint-based matching is a fundamental component of modern 3D vision systems, such as Structure-from-Motion (SfM) and SLAM. Most existing learning-based methods are trained on image pairs, a paradigm that fails to explicitly optimize for the long-term trackability of keypoints across sequences under challenging viewpoint and illumination changes. In this paper, we reframe keypoint detection as a sequential decision-making problem. We introduce TraqPoint, a novel, end-to-end Reinforcement Learning (RL) framework designed to optimize the \textbf{Tra}ck-\textbf{q}uality (Traq) of keypoints directly on image sequences. Our core innovation is a track-aware reward mechanism that jointly encourages the consistency and distinctiveness of keypoints across multiple views, guided by a policy gradient method. Extensive evaluations on sparse matching benchmarks, including relative pose estimation and 3D reconstruction, demonstrate that TraqPoint significantly outperforms some state-of-the-art (SOTA) keypoint detection and description methods.

Computer Vision RLHF & Preference Learning Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

From Pairs to Sequences: Track-Aware Policy Gradients for Keypoint Detection

Related Papers