Prem Seetharaman

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Speech & Audio (3)Architecture Design (Transformers, SSMs, MoE) (2)Multimodal Models (2)Computer Vision (1)

Frequent co-authors

Oriol Nieto (3)Justin Salamon (3)Rithesh Kumar (2)Zeyu Jin (2)

Papers (3)

Feb 19, 2026

3w ago

AudioChat: Unified Audio Storytelling, Editing, and Understanding with Transfusion Forcing

AudioChat tackles the complexity of "audio stories" by using LLM-driven tool-calling agents to simulate user interactions, enabling audio foundation models to generate, edit, and understand complex multi-source acoustic scenes.

William Chen, Prem Seetharaman, Rithesh Kumar +4

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Speech & Audio

Feb 18, 2026

3w ago

Generative Audio Extension and Morphing

Seamlessly extend and morph audio clips using a diffusion model with masked latents and classifier-free guidance, achieving near-realism and opening new creative possibilities for sound design.

Prem Seetharaman, Oriol Nieto, Justin Salamon

Architecture Design (Transformers, SSMs, MoE)Computer Vision Speech & Audio

Feb 17, 2026

3w ago·also Google Research, Adobe Research, ByteDance

TAC: Timestamped Audio Captioning

A new model, TAC, uses synthetic training data to achieve state-of-the-art audio and audio-visual reasoning by generating temporally grounded captions that can then be fed into LLMs.

Sonal Kumar, Prem Seetharaman, Ke Chen +7

Data Curation & Synthetic Data Multimodal Models Speech & Audio

Search

Prem Seetharaman

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)