Jiaqi Su

Papers on Lattice

Total citations

Topics

h-index

Research focus

Data Curation & Synthetic Data (1)Multimodal Models (1)Speech & Audio (1)

Frequent co-authors

Sonal Kumar (1)Prem Seetharaman (1)Oriol Nieto (1)Zhepei Wang (1)

Papers (1)

Feb 17, 2026

Feb 17, 2026·also Google Research, Adobe Research, ByteDance

TAC: Timestamped Audio Captioning

A new model, TAC, uses synthetic training data to achieve state-of-the-art audio and audio-visual reasoning by generating temporally grounded captions that can then be fed into LLMs.

Sonal Kumar, Prem Seetharaman, Oriol Nieto +5

Data Curation & Synthetic Data Multimodal Models Speech & Audio

Search

Jiaqi Su

Research focus

Frequent co-authors

Papers (1)