Search papers, labs, and topics across Lattice.
The paper introduces a self-supervised Geometric Attribute Exploration Network (GAEor) to improve ultra-fine-grained visual categorization (Ultra-FGVC) in data-limited scenarios by exploiting intrinsic geometrical features. GAEor amplifies geometry-relevant details using visual feedback from a backbone network and then embeds the relative polar coordinates of these details into the final representation. Experiments on five Ultra-FGVC benchmarks show that GAEor achieves state-of-the-art results, demonstrating the effectiveness of geometric attributes as recognition cues.
Soybean leaves have intricate vein structures that unlock state-of-the-art ultra-fine-grained visual categorization, even with limited data.
This paper investigates the intrinsic geometrical features of highly similar objects and introduces a general self-supervised framework called the Geometric Attribute Exploration Network (GAEor), which is designed to address the ultra-fine-grained visual categorization (Ultra-FGVC) task in data-limited scenarios. Unlike prior work that often captures subtle yet critical distinctions, GAEor generates geometric attributes as novel alternative recognition cues. These attributes are determined by various details within the object, aligned with its geometric patterns, such as the intricate vein structures in soybean leaves. Crucially, each category exhibits distinct geometric descriptors that serve as powerful cues, even among objects with minimal visual variation -- a factor largely overlooked in recent research. GAEor discovers these geometric attributes by first amplifying geometry-relevant details via visual feedback from a backbone network, then embedding the relative polar coordinates of these details into the final representation. Extensive experiments demonstrate that GAEor significantly sets new state-of-the-art records in five widely-used Ultra-FGVC benchmarks.