Search papers, labs, and topics across Lattice.
This paper introduces FGAesthetics, a new dataset for fine-grained image aesthetic assessment (IAA) comprising 32,217 images organized into series with pairwise comparison annotations. To address the limitations of existing coarse-grained IAA models, the authors propose FGAesQ, a novel framework that learns discriminative aesthetic scores from relative ranks using techniques like Difference-preserved Tokenization, Comparative Text-assisted Alignment, and Rank-aware Regression. Experiments demonstrate FGAesQ's superior performance in fine-grained scenarios while maintaining competitive coarse-grained evaluation.
You can now reliably pick the *most* aesthetically pleasing image from a subtly different series, thanks to a new dataset and framework designed specifically for fine-grained image aesthetic assessment.
Image aesthetic assessment (IAA) has extensive applications in content creation, album management, and recommendation systems, etc. In such applications, it is commonly needed to pick out the most aesthetically pleasing image from a series of images with subtle aesthetic variations, a topic we refer to as fine-grained IAA. Unfortunately, state-of-the-art IAA models are typically designed for coarse-grained evaluation, where images with notable aesthetic differences are evaluated independently on an absolute scale. These models are inherently limited in discriminating fine-grained aesthetic differences. To address the dilemma, we contribute FGAesthetics, a fine-grained IAA database with 32,217 images organized into 10,028 series, which are sourced from diverse categories including Natural, AIGC, and Cropping. Annotations are collected via pairwise comparisons within each series. We also devise Series Refinement and Rank Calibration to ensure the reliability of data and labels. Based on FGAesthetics, we further propose FGAesQ, a novel IAA framework that learns discriminative aesthetic scores from relative ranks through Difference-preserved Tokenization (DiffToken), Comparative Text-assisted Alignment (CTAlign), and Rank-aware Regression (RankReg). FGAesQ enables accurate aesthetic assessment in fine-grained scenarios while still maintains competitive performance in coarse-grained evaluation. Extensive experiments and comparisons demonstrate the superiority of the proposed method.