Prem Seetharaman

Papers on Lattice

Total citations

Topics

h-index

Research focus

Speech & Audio (4)Architecture Design (Transformers, SSMs, MoE) (2)Multimodal Models (2)Eval Frameworks & Benchmarks (1)Red-Teaming & Adversarial Robustness (1)

Frequent co-authors

Oriol Nieto (3)Justin Salamon (3)Sonal Kumar (2)Rithesh Kumar (2)

Papers (4)

Mar 31, 2026

Mar 31, 2026·also IIT Delhi, Indraprastha Institute of Information, Jaypee Institute of Information

Audio Hallucination Attacks: Probing the Reliability of Large Audio Language Models

LALMs can be easily tricked into "hearing" things that aren't there, with success rates as high as 95% on targeted attacks.

Ashish Seth, Sonal Kumar, Ramaneswaran Selvakumar +5

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Speech & Audio

Feb 19, 2026

CMU MLFeb 19, 2026

AudioChat: Unified Audio Storytelling, Editing, and Understanding with Transfusion Forcing

AudioChat tackles the complexity of "audio stories" by using LLM-driven tool-calling agents to simulate user interactions, enabling audio foundation models to generate, edit, and understand complex multi-source acoustic scenes.

Prem Seetharaman, Rithesh Kumar, Oriol Nieto +1

Architecture Design (Transformers, SSMs, MoE)Multimodal Models Speech & Audio

Feb 18, 2026

Prem Seetharaman +2Feb 18, 2026

Generative Audio Extension and Morphing

Seamlessly extend and morph audio clips using a diffusion model with masked latents and classifier-free guidance, achieving near-realism and opening new creative possibilities for sound design.

Prem Seetharaman, Oriol Nieto, Justin Salamon

Architecture Design (Transformers, SSMs, MoE)Computer Vision Speech & Audio

Feb 17, 2026

Feb 17, 2026·also Google Research, Adobe Research, ByteDance

TAC: Timestamped Audio Captioning

A new model, TAC, uses synthetic training data to achieve state-of-the-art audio and audio-visual reasoning by generating temporally grounded captions that can then be fed into LLMs.

Sonal Kumar, Prem Seetharaman, Oriol Nieto +5

Data Curation & Synthetic Data Multimodal Models Speech & Audio

Search

Prem Seetharaman

Research focus

Frequent co-authors

Papers (4)