Ramani Duraiswami

University of Maryland

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Multimodal Models (3)Speech & Audio (3)Open-Source Models & Weights (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Siddharth Gururani (2)R. Duraiswami (2)Mohammad Shoeybi (2)Vaibhavi Lokegaonkar (1)

Papers (4)

Apr 19, 2026

1w ago·also Dolby Laboratories

Video-Robin: Autoregressive Diffusion Planning for Intent-Grounded Video-to-Music Generation

Generate semantically aligned, high-fidelity music for videos with unprecedented speed and control by combining autoregressive planning and diffusion.

Vaibhavi Lokegaonkar, Aryan Vijay Bhosale, Vishnu Raj +5

Multimodal Models Speech & Audio

Apr 13, 2026

NVIDIA2w ago·also IIT Delhi, Indraprastha Institute of Information, Jaypee Institute of Information, UMD

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Audio-language models can now reason about 30-minute-long audio clips with timestamp-grounded intermediate steps, unlocking a new level of fine-grained understanding.

Sreyan Ghosh, Arushi Goel, Kaousheik Jayakumar +17

Multimodal Models Open-Source Models & Weights Speech & Audio

Mar 14, 2026

NVIDIAMar 14, 2026·also UMD

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Current multimodal models are surprisingly bad at understanding long, complex videos, struggling to integrate audio, visual, and text cues even for basic reasoning tasks.

Vatsal Agarwal, Katie Lyons, James Case +6

Eval Frameworks & Benchmarks Multimodal Models Reasoning & Chain-of-Thought

Mar 2, 2026

Analytical Exploration of Spatial Audio Cues: A Differentiable Multi-Sphere Scattering Model

Forget HRTFs: a differentiable multi-sphere scattering model inspired by underwater animal acoustics offers a new foundation for spatial audio processing and localization.

Siminfar Samakoush Galougah, Siminfar Samakoush Galougah, Pranav Pulijala +3

Robotics & Embodied AI Scientific Discovery & Drug Design Speech & Audio

Search

Ramani Duraiswami

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)