OPPOFeb 26, 2026arXiv:2602.22944

MViR: Multi-View Visual-Semantic Representation for Fake News Detection

Haochen Liang, Haocheng Liang, Xinqi Su, Jun Wang, Chaomeng Chen, Chaomeng Chen, Zitong Yu

AI Summary

The paper introduces MViR, a novel framework for fake news detection that leverages multi-view visual-semantic representations. MViR employs pyramid dilated convolution to extract multi-view visual-semantic features from news images, which are then fused with textual information using a dedicated fusion module. Experiments on benchmark datasets demonstrate that MViR outperforms existing multimodal fake news detection methods.

Key Contribution

By explicitly modeling multi-view visual-semantic relationships, MViR achieves state-of-the-art fake news detection, suggesting that subtle image-text alignments are critical for identifying misinformation.

Abstract

With the rise of online social networks, detecting fake news accurately is essential for a healthy online environment. While existing methods have advanced multimodal fake news detection, they often neglect the multi-view visual-semantic aspects of news, such as different text perspectives of the same image. To address this, we propose a Multi-View Visual-Semantic Representation (MViR) framework. Our approach includes a Multi-View Representation module using pyramid dilated convolution to capture multi-view visual-semantic features, a Multi-View Feature Fusion module to integrate these features with text, and multiple aggregators to extract multi-view semantic cues for detection. Experiments on benchmark datasets demonstrate the superiority of MViR. The source code of FedCoop is available at https://github.com/FlowerinZDF/FakeNews-MVIR.

Computer Vision Multimodal Models Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References26

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

MViR: Multi-View Visual-Semantic Representation for Fake News Detection

Related Papers