CASECNUGIST GuangdongFeb 26, 2026arXiv:2602.22963

FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning

Zehao Li, Hongwei Yu, Hao Jiang, Qiang Sheng, Qiang Sheng, Yilong Xu, Yilong Xu, Baolong Bi, Baolong Bi, Yang Li, Yang Li, Zhenlong Yuan, Yujun Cai, Zhaoqi Wang

AI Summary

The paper introduces FactGuard, an agentic framework for video misinformation detection that uses MLLMs to iteratively reason and selectively invoke external tools for evidence gathering. This approach addresses the limitations of fixed-depth inference and over-reliance on internal assumptions in MLLMs, especially when evidence is sparse or requires external verification. FactGuard achieves state-of-the-art performance on FakeSV, FakeTT, and FakeVV datasets, demonstrating improved robustness and generalization through a two-stage training strategy combining supervised fine-tuning and decision-aware reinforcement learning.

Key Contribution

FactGuard's agentic framework substantially boosts video misinformation detection by strategically using external tools and reinforcement learning to overcome the inherent limitations of relying solely on internal MLLM reasoning.

Abstract

Multimodal large language models (MLLMs) have substantially advanced video misinformation detection through unified multimodal reasoning, but they often rely on fixed-depth inference and place excessive trust in internally generated assumptions, particularly in scenarios where critical evidence is sparse, fragmented, or requires external verification. To address these limitations, we propose FactGuard, an agentic framework for video misinformation detection that formulates verification as an iterative reasoning process built upon MLLMs. FactGuard explicitly assesses task ambiguity and selectively invokes external tools to acquire critical evidence, enabling progressive refinement of reasoning trajectories. To further strengthen this capability, we introduce a two-stage training strategy that combines domain-specific agentic supervised fine-tuning with decision-aware reinforcement learning to optimize tool usage and calibrate risk-sensitive decision making. Extensive experiments on FakeSV, FakeTT, and FakeVV demonstrate FactGuard's state-of-the-art performance and validate its excellent robustness and generalization capacity.

Multimodal Models Red-Teaming & Adversarial Robustness Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References42

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning

Related Papers