Stanford HAIMar 18, 2026arXiv:2603.17475

Humans and transformer LMs: Abstraction drives language learning

AI Summary

This paper investigates how transformer LMs learn linguistic categories by comparing their learning trajectories to abstract feature-based and concrete exemplar-based accounts of human language acquisition. Using divergence-based metrics to track next-token distributions during GPT-2 small training, the authors find that abstract class-level behavior emerges before lexical item-specific behavior. They also observe abrupt, sequential emergence of different linguistic behaviors, suggesting abstraction is crucial for language model learning.

Key Contribution

Transformer LMs learn linguistic abstractions before memorizing specific lexical items, mirroring key aspects of human language acquisition.

Abstract

Categorization is a core component of human linguistic competence. We investigate how a transformer-based language model (LM) learns linguistic categories by comparing its behaviour over the course of training to behaviours which characterize abstract feature-based and concrete exemplar-based accounts of human language acquisition. We investigate how lexical semantic and syntactic categories emerge using novel divergence-based metrics that track learning trajectories using next-token distributions. In experiments with GPT-2 small, we find that (i) when a construction is learned, abstract class-level behaviour is evident at earlier steps than lexical item-specific behaviour, and (ii) that different linguistic behaviours emerge abruptly in sequence at different points in training, revealing that abstraction plays a key role in how LMs learn. This result informs the models of human language acquisition that LMs may serve as an existence proof for.

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Humans and transformer LMs: Abstraction drives language learning

Related Papers