Search papers, labs, and topics across Lattice.
The paper identifies a key pathology in supervised low-level vision training: photometric inconsistencies between image pairs lead to suboptimal gradient allocation, hindering content restoration. They prove the orthogonality of photometric and structural components in the prediction-target residual under least-squares decomposition, highlighting the dominance of the photometric component in gradient energy. To address this, they introduce Photometric Alignment Loss (PAL), which discounts photometric discrepancies via closed-form affine color alignment.
Seemingly innocuous photometric variations between training pairs cripple low-level vision models, but a simple closed-form alignment loss can unlock significant gains across diverse tasks and architectures.
Supervised low-level vision models rely on pixel-wise losses against paired references, yet paired training sets exhibit per-pair photometric inconsistency, say, different image pairs demand different global brightness, color, or white-balance mappings. This inconsistency enters through task-intrinsic photometric transfer (e.g., low-light enhancement) or unintended acquisition shifts (e.g., de-raining), and in either case causes an optimization pathology. Standard reconstruction losses allocate disproportionate gradient budget to conflicting per-pair photometric targets, crowding out content restoration. In this paper, we investigate this issue and prove that, under least-squares decomposition, the photometric and structural components of the prediction-target residual are orthogonal, and that the spatially dense photometric component dominates the gradient energy. Motivated by this analysis, we propose Photometric Alignment Loss (PAL). This flexible supervision objective discounts nuisance photometric discrepancy via closed-form affine color alignment while preserving restoration-relevant supervision, requiring only covariance statistics and tiny matrix inversion with negligible overhead. Across 6 tasks, 16 datasets, and 16 architectures, PAL consistently improves metrics and generalization. The implementation is in the appendix.