Search papers, labs, and topics across Lattice.
The paper announces the 4th BabyLM workshop and competition, focusing on bridging cognitive modeling and language modeling. This year's event includes a general track for data-efficient pretraining and a new multilingual track to broaden the scope of the challenge. The call extends to papers on training efficiency, cognitive plausibility, and weak model evaluation, encouraging contributions beyond the competition itself.
BabyLM 2026 seeks to push the boundaries of data-efficient and cognitively plausible language models, now with a multilingual twist.
BabyLM aims to dissolve the boundaries between cognitive modeling and language modeling. We call for both workshop papers and for researchers to join the 4th BabyLM competition. As in previous years, we call for participants in the data-efficient pretraining challenge in the general track. This year, we also offer a new track: Multilingual. We also call for papers outside the competition in any relevant areas. These include training efficiency, cognitively plausible research, weak model evaluation, and more.