Mar 1, 2026arXiv:2603.01270

VoxKnesset: A Large-Scale Longitudinal Hebrew Speech Dataset for Aging Speaker Modeling

Yanir Marmor, Arad Zulti, David Krongauz, Adam Gabet, Yoad Snapir, Yair Lifshitz, Eran Segal

AI Summary

The paper introduces VoxKnesset, a new open-access dataset of ~2,300 hours of longitudinal Hebrew parliamentary speech data from 393 speakers recorded between 2009-2025. Using this dataset, the authors benchmarked the performance of WavLM-Large, ECAPA-TDNN, and Wav2Vec2-XLSR-1B on age prediction and speaker verification tasks under longitudinal conditions, revealing a degradation in speaker verification performance over time. The study demonstrates the importance of longitudinal training for age-aware speech models, showing that such models can capture meaningful temporal signals related to aging, unlike cross-sectionally trained models.

Key Contribution

Existing speech models stumble when voices age, but a new 2,300-hour longitudinal Hebrew speech dataset reveals how much performance degrades over a 15-year span, paving the way for aging-robust systems.

Abstract

Speech processing systems face a fundamental challenge: the human voice changes with age, yet few datasets support rigorous longitudinal evaluation. We introduce VoxKnesset, an open-access dataset of ~2,300 hours of Hebrew parliamentary speech spanning 2009-2025, comprising 393 speakers with recording spans of up to 15 years. Each segment includes aligned transcripts and verified demographic metadata from official parliamentary records. We benchmark modern speech embeddings (WavLM-Large, ECAPA-TDNN, Wav2Vec2-XLSR-1B) on age prediction and speaker verification under longitudinal conditions. Speaker verification EER rises from 2.15\% to 4.58\% over 15 years for the strongest model, and cross-sectionally trained age regressors fail to capture within-speaker aging, while longitudinally trained models recover a meaningful temporal signal. We publicly release the dataset and pipeline to support aging-robust speech systems and Hebrew speech processing.

Data Curation & Synthetic Data Natural Language Processing Speech & Audio

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

VoxKnesset: A Large-Scale Longitudinal Hebrew Speech Dataset for Aging Speaker Modeling

Related Papers