Mar 29, 2026arXiv:2603.27855

What can LLMs tell us about the mechanisms behind polarity illusions in humans? Experiments across model scales and training steps

AI Summary

This paper investigates the emergence of polarity illusions (NPI and depth charge) in LLMs using the Pythia suite. They find that the NPI illusion weakens with scale, while the depth charge illusion strengthens, suggesting that complex "rational inference" mechanisms may not be necessary to explain these illusions in humans. Instead, the authors propose a construction grammar-based synthesis to explain the observed phenomena in both LLMs and humans.

Key Contribution

LLMs exhibit polarity illusions without rational inference, suggesting that "good enough" processing and partial grammaticalization may suffice to explain these phenomena in both machines and humans.

Abstract

I use the Pythia scaling suite (Biderman et al. 2023) to investigate if and how two well-known polarity illusions, the NPI illusion and the depth charge illusion, arise in LLMs. The NPI illusion becomes weaker and ultimately disappears as model size increases, while the depth charge illusion becomes stronger in larger models. The results have implications for human sentence processing: it may not be necessary to assume "rational inference" mechanisms that convert ill-formed sentences into well-formed ones to explain polarity illusions, given that LLMs cannot plausibly engage in this kind of reasoning, especially at the implicit level of next-token prediction. On the other hand, shallow, "good enough" processing and/or partial grammaticalization of prescriptively ungrammatical structures may both occur in LLMs. I propose a synthesis of different theoretical accounts that is rooted in the basic tenets of construction grammar.

Natural Language Processing Open-Source Models & Weights Scaling Laws & Emergent Abilities

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

What can LLMs tell us about the mechanisms behind polarity illusions in humans? Experiments across model scales and training steps

Related Papers