Sergio Gastón Burdisso

Open-weight models can now generate realistic, long-form doctor-patient conversations with corresponding SOAP notes, providing a valuable resource for training and evaluating long-context audio reasoning systems.

Yanis Labrak, David Grünert, David Grunert +20

Data Curation & Synthetic Data Natural Language Processing Speech & Audio

Apr 7, 2026

Closing the Speech-Text Gap with Limited Audio for Effective Domain Adaptation in LLM-Based ASR

Just 4 hours of speech data closes the modality gap in LLM-based ASR, rivaling full-dataset fine-tuning and unlocking effective domain adaptation.

Thibault Bañeras-Roux, Sergio Gastón Burdisso, Esaú Villatoro-Tello +8

Multimodal Models Natural Language Processing Speech & Audio

Mar 27, 2026

Mar 27, 2026·also EPFL, UZH

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

LLM-based ASR can get a context boost without the compute cost: compress prior audio turns into learned latent tokens and retain transcripts to recover accuracy while shrinking the audio footprint.

Shashi Kumar, Esaú Villatoro-Tello, Sergio Gastón Burdisso +7

Inference & Quantization Natural Language Processing Speech & Audio

Search

Sergio Gastón Burdisso

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (5)