Search papers, labs, and topics across Lattice.
Nanjing University
2
0
3
RDMF outperforms traditional multimodal fusion methods by leveraging reaction-diffusion processes to dynamically align video and text, revealing emergent patterns that enhance moment retrieval.
Segment-level explainable forensics can drastically enhance our ability to detect and interpret localized manipulations in lengthy AI-generated videos.