Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Seonwook Park | Lattice

Seonwook Park

NVIDIA

NVIDIA Research

Papers on Lattice

1

Total citations

0

Topics

3

h-index

1

Publication activitypapers/week, last 8 weeks

Research focus

Eval Frameworks & Benchmarks (1)Multimodal Models (1)Speech & Audio (1)

Frequent co-authors

Amrita Mazumdar (1)Rajarshi Roy (1)N. Srihari (1)Shengze Wang (1)

Papers (1)

May 28, 2026

NVIDIA3d ago·also AV (audio in / cascaded avatar out)

VideoFDB: Evaluating Full-Duplex Vision-Speech Capabilities in Conversational Agents

Current vision-speech agents are surprisingly bad at mimicking the subtle, real-time audio-visual cues that make human conversation feel natural.

Amrita Mazumdar, Seonwook Park, Rajarshi Roy +6

Eval Frameworks & Benchmarks Multimodal Models Speech & Audio

Yuhao Zhou (1)

Koki Nagano (1)

Shalini De Mello (1)