IBM ResearchFeb 23, 2026arXiv:2602.20092

BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop

Leshem Choshen, Leshem Choshen, Ryan Cotterell, Mustafa Omer Gul, Mustafa Omer Gul, Jaap Jumelet, Jaap Jumelet, Tal Linzen, Tal Linzen, Aaron Mueller, Suchir Salhan, Raj Sanjay Shah, Raj Sanjay Shah, A. Warstadt, Alex Warstadt, E. Wilcox, Ethan Gotlieb Wilcox

AI Summary

The paper announces the 4th BabyLM workshop and competition, focusing on bridging cognitive modeling and language modeling. This year's event includes a general track for data-efficient pretraining and a new multilingual track to broaden the scope of the challenge. The call extends to papers on training efficiency, cognitive plausibility, and weak model evaluation, encouraging contributions beyond the competition itself.

Key Contribution

BabyLM 2026 seeks to push the boundaries of data-efficient and cognitively plausible language models, now with a multilingual twist.

Abstract

BabyLM aims to dissolve the boundaries between cognitive modeling and language modeling. We call for both workshop papers and for researchers to join the 4th BabyLM competition. As in previous years, we call for participants in the data-efficient pretraining challenge in the general track. This year, we also offer a new track: Multilingual. We also call for papers outside the competition in any relevant areas. These include training efficiency, cognitively plausible research, weak model evaluation, and more.

Data Curation & Synthetic Data Natural Language Processing Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References25

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop

Related Papers