Search papers, labs, and topics across Lattice.
This paper introduces triangular consistency as a universal constraint for learning optical flow, applicable across various network architectures, supervision types, and datasets. By composing two optical flows to derive a third, the method enforces consistency in both image-pair and multi-frame contexts, enhancing the robustness of flow estimation. Experimental results demonstrate significant performance improvements in supervised, unsupervised, and transfer learning scenarios, highlighting the method's versatility and effectiveness.
Triangular consistency can universally enhance optical flow learning, yielding consistent performance gains across diverse training settings without additional annotations.
We propose triangular consistency as a first-principled constraint for optical flow, which is agnostic to network architecture, supervision type, and dataset, and applies to both image-pair and multi-frame settings. This simple but powerful constraint is to compose two flows to induce a third flow and enforce consistency among the three. The composed flows may arise from (i) image pairs, yielding cycle consistency; (ii) multiple video frames, producing longer-range motion through temporal chaining; or (iii) image pairs combined with controlled synthetic transformations, which becomes data augmentation. This triangular consistency introduces negligible computational overhead and requires no additional annotations. Since it is derived directly from the geometry of optical flow, it does not rely on model-specific assumptions and serves as a ``universal'' plug-and-play component for optical flow training. Experiments show consistent improvement across supervised, unsupervised, and transfer learning settings.