Mar 3, 2026arXiv:2603.02710

MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration

Lingshun Kong, Jiawei Zhang, Zhengpeng Duan, Xiaohe Wu, Yueqi Yang, Xiaotao Wang, Dongqing Zou, Lei Lei, Jinshan Pan

AI Summary

This paper introduces MiM-DiT, a novel all-in-one image restoration framework leveraging a dual-level Mixture-of-Experts (MoE) architecture integrated with a pre-trained diffusion model. The Inter-MoE layer handles major degradation types by adaptively combining expert groups, while the Intra-MoE layer selects specialized sub-experts for fine-grained variations. Experiments demonstrate state-of-the-art performance across multiple image restoration tasks, showcasing the framework's ability to handle complex, real-world corruptions through coarse-grained adaptation and fine-grained modulation.

Key Contribution

A dual-level Mixture-of-Experts architecture can effectively unify diverse image restoration tasks within a single diffusion transformer model, achieving state-of-the-art results.

Abstract

All-in-one image restoration is challenging because different degradation types, such as haze, blur, noise, and low-light, impose diverse requirements on restoration strategies, making it difficult for a single model to handle them effectively. In this paper, we propose a unified image restoration framework that integrates a dual-level Mixture-of-Experts (MoE) architecture with a pretrained diffusion model. The framework operates at two levels: the Inter-MoE layer adaptively combines expert groups to handle major degradation types, while the Intra-MoE layer further selects specialized sub-experts to address fine-grained variations within each type. This design enables the model to achieve coarse-grained adaptation across diverse degradation categories while performing fine-grained modulation for specific intra-class variations, ensuring both high specialization in handling complex, real-world corruptions. Extensive experiments demonstrate that the proposed method performs favorably against the state-of-the-art approaches on multiple image restoration task.

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration

Related Papers