Search papers, labs, and topics across Lattice.
This paper introduces a unified hidden-code recovery framework for both retrieval and restoration of deepfakes, addressing the gap in recovering tampered content for factual retrieval. The method encodes semantic and perceptual information into a compact hidden-code, refined using multi-scale vector quantization and conditional Transformers for enhanced contextual reasoning. The authors also introduce ImageNet-S, a new benchmark dataset for paired image-label factual retrieval tasks, demonstrating the method's retrieval and reconstruction capabilities across various watermarking pipelines.
Forget just detecting deepfakes – this work recovers the original image and its factual context, even after in-generation watermarking.
Recent advances in image authenticity have primarily focused on deepfake detection and localization, leaving recovery of tampered contents for factual retrieval relatively underexplored. We propose a unified hidden-code recovery framework that enables both retrieval and restoration from post-hoc and in-generation watermarking paradigms. Our method encodes semantic and perceptual information into a compact hidden-code representation, refined through multi-scale vector quantization, and enhances contextual reasoning via conditional Transformer modules. To enable systematic evaluation for natural images, we construct ImageNet-S, a benchmark that provides paired image-label factual retrieval tasks. Extensive experiments on ImageNet-S demonstrate that our method exhibits promising retrieval and reconstruction performance while remaining fully compatible with diverse watermarking pipelines. This framework establishes a foundation for general-purpose image recovery beyond detection and localization.