Latticethe structure behind the noise

Papers Digest Topics Selected Labs Collections FAQ

Created by Flynn Lachendro

Papers Digest Topics Labs Saved

Search

Search papers, labs, and topics across Lattice.

Built by Flynn Lachendro·𝕏 / Twitter·RSS··FAQ·Glossary·Privacy

Esaú Villatoro-Tello | Lattice

Esaú Villatoro-Tello

Papers on Lattice

3

Total citations

0

Topics

6

h-index

18

Publication activitypapers/week, last 8 weeks

Research focus

Speech & Audio (3)Natural Language Processing (2)Data Curation & Synthetic Data (1)RLHF & Preference Learning (1)

Frequent co-authors

Shashi Kumar (3)Hasindri Watawana (3)Sergio Gastón Burdisso (3)Petr Motlícek (3)

Papers (3)

Jul 9, 2026

4d ago

When Synthetic Speech Is All You Have: Better Call GRPO

Reinforcement learning outperforms supervised fine-tuning in adapting ASR systems to synthetic speech, achieving a 40% reduction in word error rates.

Shashi Kumar, Yanis Labrak, Hasindri Watawana +5

Data Curation & Synthetic Data RLHF & Preference Learning Speech & Audio

Apr 7, 2026

Apr 7, 2026

Closing the Speech-Text Gap with Limited Audio for Effective Domain Adaptation in LLM-Based ASR

Just 4 hours of speech data closes the modality gap in LLM-based ASR, rivaling full-dataset fine-tuning and unlocking effective domain adaptation.

Thibault Bañeras-Roux, Sergio Gastón Burdisso, Esaú Villatoro-Tello +8

Multimodal Models Natural Language Processing Speech & Audio

Mar 27, 2026

Mar 27, 2026·also EPFL, UZH

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR

LLM-based ASR can get a context boost without the compute cost: compress prior audio turns into learned latent tokens and retain transcripts to recover accuracy while shrinking the audio footprint.

Shashi Kumar, Esaú Villatoro-Tello, Sergio Gastón Burdisso +7

Inference & Quantization Natural Language Processing Speech & Audio

Multimodal Models (1)

Inference & Quantization (1)

A. Stolcke (3)

Thibault Bañeras-Roux (2)

Kadri Hacioglu (2)

Yanis Labrak (1)