Bar-IlanNTT Communication Science LaboratoriesFeb 17, 2026arXiv:2602.15405

Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

Gilad Nurko, Roi Benita, Yehoshua Dissen, Tomohiro Nakatani, Marc Delcroix, Shoko Araki, Joseph Keshet

AI Summary

This paper introduces a novel framework for robust classification in noisy environments by jointly modeling signal enhancement and classification using coupled diffusion models operating on the input signal and classifier logits. The approach allows for mutual guidance between the enhancing signal and the evolving class logits, improving classification accuracy without requiring classifier retraining. Experiments on image classification and automatic speech recognition demonstrate that the proposed framework outperforms traditional sequential enhancement baselines under various noise conditions.

Key Contribution

Denoise and classify in one fell swoop: coupled diffusion models on signals and logits boost accuracy without retraining your classifier.

Abstract

Robust classification in noisy environments remains a fundamental challenge in machine learning. Standard approaches typically treat signal enhancement and classification as separate, sequential stages: first enhancing the signal and then applying a classifier. This approach fails to leverage the semantic information in the classifier's output during denoising. In this work, we propose a general, domain-agnostic framework that integrates two interacting diffusion models: one operating on the input signal and the other on the classifier's output logits, without requiring any retraining or fine-tuning of the classifier. This coupled formulation enables mutual guidance, where the enhancing signal refines the class estimation and, conversely, the evolving class logits guide the signal reconstruction towards discriminative regions of the manifold. We introduce three strategies to effectively model the joint distribution of the input and the logit. We evaluated our joint enhancement method for image classification and automatic speech recognition. The proposed framework surpasses traditional sequential enhancement baselines, delivering robust and flexible improvements in classification accuracy under diverse noise conditions.

Computer Vision Speech & Audio

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Joint Enhancement and Classification using Coupled Diffusion Models of Signals and Logits

Related Papers