Mar 16, 2026arXiv:2603.15168

Multimodal Connectome Fusion via Cross-Attention for Autism Spectrum Disorder Classification Using Graph Learning

Ansar Rahman, Hassan Shojaee-Mend, Sepideh Hatamikia

AI Summary

This paper introduces a multimodal graph learning framework for Autism Spectrum Disorder (ASD) classification that integrates functional and structural MRI data along with phenotypic information. The framework uses Edge Variational GCNs to learn subject-level embeddings from functional and structural features, and introduces an asymmetric transformer-based cross-attention mechanism to fuse these embeddings, prioritizing functional connectivity. Evaluated on the ABIDE-I dataset, the framework achieves state-of-the-art ASD classification performance, with an AUC of 87.3% and accuracy of 84.4% under 10-fold cross-validation, and 82.0% accuracy under leave-one-site-out cross-validation.

Key Contribution

By fusing functional and structural MRI data with a novel cross-attention mechanism, this model boosts ASD classification accuracy by up to 7% compared to existing methods, especially in cross-site validation.

Abstract

Autism spectrum disorder (ASD) is a complex neurodevelopmental condition characterized by atypical functional brain connectivity and subtle structural alterations. rs-fMRI has been widely used to identify disruptions in large-scale brain networks, while structural MRI provides complementary information about morphological organization. Despite their complementary nature, effectively integrating these heterogeneous imaging modalities within a unified framework remains challenging. This study proposes a multimodal graph learning framework that preserves the dominant role of functional connectivity while integrating structural imaging and phenotypic information for ASD classification. The proposed framework is evaluated on ABIDE-I dataset. Each subject is represented as a node within a population graph. Functional and structural features are extracted as modality-specific node attributes, while inter-subject relationships are modeled using a pairwise association encoder (PAE) based on phenotypic information. Two Edge Variational GCNs are trained to learn subject-level embeddings. To enable effective multimodal integration, we introduce a novel asymmetric transformer-based cross-attention mechanism that allows functional embeddings to selectively incorporate complementary structural information while preserving functional dominance. The fused embeddings are then passed to a MLP for ASD classification. Using stratified 10-fold cross-validation, the framework achieved an AUC of 87.3% and an accuracy of 84.4%. Under leave-one-site-out cross-validation (LOSO-CV), the model achieved an average cross-site accuracy of 82.0%, outperforming existing methods by approximately 3% under 10-fold cross-validation and 7% under LOSO-CV. The proposed framework effectively integrates heterogeneous multimodal data from the multi-site ABIDE-I dataset, improving automated ASD classification across imaging sites.

Multimodal Models Scientific Discovery & Drug Design

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Multimodal Connectome Fusion via Cross-Attention for Autism Spectrum Disorder Classification Using Graph Learning

Related Papers