Kaousheik Jayakumar

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (2)Speech & Audio (2)Open-Source Models & Weights (1)Interpretability & Mechanistic Interp (1)

Frequent co-authors

Sreyan Ghosh (2)Dinesh Manocha (2)Arushi Goel (1)Lasha Koroshinadze (1)

Papers (2)

Apr 13, 2026

NVIDIA1w ago·also IIT Delhi, Indraprastha Institute of Information, Jaypee Institute of Information, UMD

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Audio-language models can now reason about 30-minute-long audio clips with timestamp-grounded intermediate steps, unlocking a new level of fine-grained understanding.

Sreyan Ghosh, Arushi Goel, Kaousheik Jayakumar +17

Multimodal Models Open-Source Models & Weights Speech & Audio

Apr 3, 2026

Ramaneswaran Selvakumar +53w ago

Do Audio-Visual Large Language Models Really See and Hear?

AVLLMs may "hear" at intermediate layers, but they largely ignore audio cues in favor of vision when generating text, revealing a fundamental modality bias.

Ramaneswaran Selvakumar, Kaousheik Jayakumar, S. Sakshi +3

Interpretability & Mechanistic Interp Multimodal Models Speech & Audio

Search

Kaousheik Jayakumar

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)