Jun 4, 2026arXiv:2606.05665

V2V-Bench: A Comprehensive Benchmark for Video-to-Video Generation Evaluation

Tao Liu, Leela Krishna, Gouti Pavan Kumar, Sreeja K, Vishav Garg

AI Summary

This paper introduces V2V-Bench, a comprehensive 11-dimension benchmark designed to evaluate video-to-video (V2V) generation by assessing temporal alignment, structural fidelity, transformation quality, video quality, and semantic alignment. The benchmark effectively pairs diverse source videos with complex editing tasks, allowing for a robust evaluation of two commercial models, Grok Imagine and Gemini Veo3, alongside the open-source model Open Sora 2. Results indicate that while Grok excels in editing fidelity, Veo3 outperforms in visual quality, with V2V-Bench achieving a Spearman correlation of 0.905 with human judgments across six V2V-specific dimensions.

Key Contribution

V2V-Bench reveals that leading models excel in different aspects of video generation, highlighting the nuanced strengths of Grok and Veo3 in editing fidelity and visual quality, respectively.

Abstract

Video-to-video (V2V) generation is difficult to evaluate because outputs must both follow editing instructions and preserve frame-level correspondence with the source video, which existing T2V and I2V metrics do not capture. We introduce V2V-Bench, a 11-dimension benchmark organized into five categories: temporal alignment, structural fidelity, transformation quality, video quality, and semantic alignment. V2V-Bench pairs diverse source videos with challenging editing tasks and evaluates two commercial models, Grok Imagine and Gemini Veo3, and one open-source model, Open Sora 2. Results show complementary model strengths: Grok performs better on editing fidelity, while Veo3 achieves stronger visual quality. On six V2V-specific dimensions, V2V-Bench reaches a Spearman correlation of 0.905 with human judgments.

Computer Vision Eval Frameworks & Benchmarks Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

V2V-Bench: A Comprehensive Benchmark for Video-to-Video Generation Evaluation

Related Papers