Hangzhou High-Tech Zone (BinjiangInstitute of Blockchain and DataZJUMar 17, 2026arXiv:2603.16747

Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation

Chenggong Hu, Mengqi Xue, Haofei Zhang, Jie Song

AI Summary

The paper introduces SLDDM-TPG, a two-stage approach for textile pattern generation (TPG) that addresses feature confusion between textile patterns and clothing distortions. First, a latent disentangled network (LDN) creates a multi-dimensional clothing feature space. Second, a semi-supervised latent diffusion model (S-LDM) uses LDN guidance and a fine-grained alignment strategy to generate high-fidelity textile patterns. Experiments on CTP-HD and VITON-HD datasets show SLDDM-TPG significantly improves FID and SSIM scores compared to existing image-to-image models.

Key Contribution

Achieve faithful textile pattern generation by disentangling clothing features and guiding a diffusion model with fine-grained alignment, outperforming existing image-to-image methods.

Abstract

Textile pattern generation (TPG) aims to synthesize fine-grained textile pattern images based on given clothing images. Although previous studies have not explicitly investigated TPG, existing image-to-image models appear to be natural candidates for this task. However, when applied directly, these methods often produce unfaithful results, failing to preserve fine-grained details due to feature confusion between complex textile patterns and the inherent non-rigid texture distortions in clothing images. In this paper, we propose a novel method, SLDDM-TPG, for faithful and high-fidelity TPG. Our method consists of two stages: (1) a latent disentangled network (LDN) that resolves feature confusion in clothing representations and constructs a multi-dimensional, independent clothing feature space; and (2) a semi-supervised latent diffusion model (S-LDM), which receives guidance signals from LDN and generates faithful results through semi-supervised diffusion training, combined with our designed fine-grained alignment strategy. Extensive evaluations show that SLDDM-TPG reduces FID by 4.1 and improves SSIM by up to 0.116 on our CTP-HD dataset, and also demonstrate good generalization on the VITON-HD dataset.

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation

Related Papers