Search papers, labs, and topics across Lattice.
This paper introduces ColorFLUX, a generative diffusion model based on FLUX, designed for accurate colorization of old photos by decoupling structure preservation from color restoration. To address the domain gap between old and modern photos, the model incorporates a progressive Direct Preference Optimization (Pro-DPO) strategy for learning subtle color preferences and uses visual semantic prompts to extract fine-grained semantic information, mitigating color bias. Experiments on synthetic and real datasets demonstrate that ColorFLUX outperforms existing state-of-the-art methods, including commercial models, in producing high-quality and vivid colorizations.
ColorFLUX achieves superior old photo colorization by cleverly disentangling structure and color, outperforming even closed-source commercial models.
Old photos preserve invaluable historical memories, making their restoration and colorization highly desirable. While existing restoration models can address some degradation issues like denoising and scratch removal, they often struggle with accurate colorization. This limitation arises from the unique degradation inherent in old photos, such as faded brightness and altered color hues, which are different from modern photo distributions, creating a substantial domain gap during colorization. In this paper, we propose a novel old photo colorization framework based on the generative diffusion model FLUX. Our approach introduces a structure-color decoupling strategy that separates structure preservation from color restoration, enabling accurate colorization of old photos while maintaining structural consistency. We further enhance the model with a progressive Direct Preference Optimization (Pro-DPO) strategy, which allows the model to learn subtle color preferences through coarse-to-fine transitions in color augmentation. Additionally, we address the limitations of text-based prompts by introducing visual semantic prompts, which extract fine-grained semantic information directly from old photos, helping to eliminate the color bias inherent in old photos. Experimental results on both synthetic and real datasets demonstrate that our approach outperforms existing state-of-the-art colorization methods, including closed-source commercial models, producing high-quality and vivid colorization.