Search papers, labs, and topics across Lattice.
The paper introduces MViR, a novel framework for fake news detection that leverages multi-view visual-semantic representations. MViR employs pyramid dilated convolution to extract multi-view visual-semantic features from news images, which are then fused with textual information using a dedicated fusion module. Experiments on benchmark datasets demonstrate that MViR outperforms existing multimodal fake news detection methods.
By explicitly modeling multi-view visual-semantic relationships, MViR achieves state-of-the-art fake news detection, suggesting that subtle image-text alignments are critical for identifying misinformation.
With the rise of online social networks, detecting fake news accurately is essential for a healthy online environment. While existing methods have advanced multimodal fake news detection, they often neglect the multi-view visual-semantic aspects of news, such as different text perspectives of the same image. To address this, we propose a Multi-View Visual-Semantic Representation (MViR) framework. Our approach includes a Multi-View Representation module using pyramid dilated convolution to capture multi-view visual-semantic features, a Multi-View Feature Fusion module to integrate these features with text, and multiple aggregators to extract multi-view semantic cues for detection. Experiments on benchmark datasets demonstrate the superiority of MViR. The source code of FedCoop is available at https://github.com/FlowerinZDF/FakeNews-MVIR.