Mar 31, 2026arXiv:2603.29520

TrafficMoE: Heterogeneity-aware Mixture of Experts for Encrypted Traffic Classification

AI Summary

The paper introduces TrafficMoE, a novel deep learning framework for encrypted traffic classification that addresses limitations of static modeling by disentangling headers and payloads using a Disentangle-Filter-Aggregate (DFA) paradigm. TrafficMoE employs dual-branch sparse Mixture-of-Experts (MoE) for modality-specific modeling, an uncertainty-aware filtering mechanism to suppress stochastic noise, and a routing-guided strategy for dynamic cross-modality feature aggregation. Experiments on six datasets demonstrate that TrafficMoE consistently outperforms state-of-the-art methods, highlighting the importance of heterogeneity-aware modeling in encrypted traffic analysis.

Key Contribution

By disentangling headers and payloads with a Mixture-of-Experts architecture, TrafficMoE achieves state-of-the-art encrypted traffic classification, proving that heterogeneity-aware modeling is crucial for extracting discriminative features from noisy, encrypted data.

Abstract

Encrypted traffic classification is a critical task for network security. While deep learning has advanced this field, the occlusion of payload semantics by encryption severely challenges standard modeling approaches. Most existing frameworks rely on static and homogeneous pipelines that apply uniform parameter sharing and static fusion strategies across all inputs. This one-size-fits-all static design is inherently flawed: by forcing structured headers and randomized payloads into a unified processing pipeline, it inevitably entangles the raw protocol signals with stochastic encryption noise, thereby degrading the fine-grained discriminative features. In this paper, we propose TrafficMoE, a framework that breaks through the bottleneck of static modeling by establishing a Disentangle-Filter-Aggregate (DFA) paradigm. Specifically, to resolve the structural between-components conflict, the architecture disentangles headers and payloads using dual-branch sparse Mixture-of-Experts (MoE), enabling modality-specific modeling. To mitigate the impact of stochastic noise, an uncertainty-aware filtering mechanism is introduced to quantify reliability and selectively suppress high-variance representations. Finally, to overcome the limitations of static fusion, a routing-guided strategy aggregates cross-modality features dynamically, that adaptively weighs contributions based on traffic context. With this DFA paradigm, TrafficMoE maximizes representational efficiency by focusing solely on the most discriminative traffic features. Extensive experiments on six datasets demonstrate TrafficMoE consistently outperforms state-of-the-art methods, validating the necessity of heterogeneity-aware modeling in encrypted traffic analysis. The source code is publicly available at https://github.com/Posuly/TrafficMoE_main.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

TrafficMoE: Heterogeneity-aware Mixture of Experts for Encrypted Traffic Classification

Related Papers