UESTCMay 5, 2026arXiv:2605.03820

Multimodal Learning on Low-Quality Data with Conformal Predictive Self-Calibration

Xun Jiang, Yufan Gu, Disen Hu, Yuqing Hou, Yazhou Yao, Fumin Shen, H. Shen, Xing Xu

AI Summary

This paper introduces Conformal Predictive Self-Calibration (CPSC), a unified framework to address modality imbalance and noisy corruption in multimodal learning by explicitly modeling predictive uncertainty. CPSC uses conformal prediction to guide representation self-calibration (selective feature fusion) and gradient self-calibration (instance-wise gradient weighting). Experiments on six datasets demonstrate that CPSC outperforms existing state-of-the-art methods under both imbalanced and noisy settings.

Key Contribution

Conformal prediction offers a surprisingly effective way to handle both modality imbalance and noisy corruption in multimodal learning by explicitly modeling predictive uncertainty during training.

Abstract

Multimodal learning often grapples with the challenge of low-quality data, which predominantly manifests as two facets: modality imbalance and noisy corruption. While these issues are often studied in isolation, we argue that they share a common root in the predictive uncertainty towards the reliability of individual modalities and instances during learning. In this paper, we propose a unified framework, termed Conformal Predictive Self-Calibration (CPSC), which leverages conformal prediction to equip the model with the ability to perform self-guided calibration on-the-fly. The core of our proposed CPSC lies in a novel self-calibrating training loop that seamlessly integrates two key modules: (1) Representation Self-Calibration, which decomposes unimodal features into components, and selectively fuses the most robust ones identified by a conformal predictor to enhance feature resilience. (2) Gradient Self-Calibration, which recalibrates the gradient flow during backpropagation based on instance-wise reliability scores, steering the optimization towards more trustworthy directions. Furthermore, we also devise a self-update strategy for the conformal predictor to ensure the entire system co-evolves consistently throughout the training process. Extensive experiments on six benchmark datasets under both imbalanced and noisy settings demonstrate that our CPSC framework consistently outperforms existing state-of-the-art methods. Our code is available at https://github.com/XunCHN/CPSC.

Data Curation & Synthetic Data Multimodal Models Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References59

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Multimodal Learning on Low-Quality Data with Conformal Predictive Self-Calibration

Related Papers