Mar 17, 2026arXiv:2603.16649

Mixture of Style Experts for Diverse Image Stylization

Shihao Zhu, Ziheng Ouyang, Zi-Juan Ouyang, Yi Kang, Yijia Kang, Qilong Wang, Mi Zhou, Bo Li, Ming-Ming Cheng, Qibin Hou

AI Summary

The paper introduces StyleExpert, a diffusion-based image stylization framework that uses a Mixture of Experts (MoE) architecture to handle diverse styles spanning multiple semantic levels. A unified style encoder maps styles into a consistent latent space, which then conditions a similarity-aware gating mechanism to route styles to specialized experts. Experiments demonstrate that StyleExpert outperforms existing methods in preserving semantics, material details, and generalization to unseen styles by leveraging this MoE architecture.

Key Contribution

Diffusion models can now capture nuanced semantic and material details in image stylization, moving beyond simple color-driven transformations, thanks to a Mixture of Experts architecture.

Abstract

Diffusion-based stylization has advanced significantly, yet existing methods are limited to color-driven transformations, neglecting complex semantics and material details.We introduce StyleExpert, a semantic-aware framework based on the Mixture of Experts (MoE). Our framework employs a unified style encoder, trained on our large-scale dataset of content-style-stylized triplets, to embed diverse styles into a consistent latent space. This embedding is then used to condition a similarity-aware gating mechanism, which dynamically routes styles to specialized experts within the MoE architecture. Leveraging this MoE architecture, our method adeptly handles diverse styles spanning multiple semantic levels, from shallow textures to deep semantics. Extensive experiments show that StyleExpert outperforms existing approaches in preserving semantics and material details, while generalizing to unseen styles. Our code and collected images are available at the project page: https://hh-lg.github.io/StyleExpert-Page/.

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Citation Metrics

Citations0

Influential citations0

References65

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Mixture of Style Experts for Diverse Image Stylization

Related Papers