Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Gantavya Bhatt | Lattice

Gantavya Bhatt

Papers on Lattice

1

Total citations

0

Topics

3

h-index

3

Research focus

Eval Frameworks & Benchmarks (1)Multimodal Models (1)Speech & Audio (1)

Frequent co-authors

Tingle Li (1)Siddharth Gururani (1)Kevin J. Shih (1)Sang-gil Lee (1)

Papers (1)

May 28, 2026

Tingle Li +8May 28, 2026

Benchmarking Single-Factor Physical Video-to-Audio Generation

V2A models prioritize text captions over visual cues when generating audio, resulting in physically plausible but often temporally misaligned sounds.

Tingle Li, Siddharth Gururani, Kevin J. Shih +6

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Zhifeng Kong (1)

Arushi Goel (1)

G. Anumanchipalli (1)

Ming-Yu Liu (1)