University of OttawaMar 9, 2026arXiv:2603.08358

Do Language Models Know Theo Has a Wife? Investigating the Proviso Problem

Tara Azin, D. Dumitrescu, Daniel Dumitrescu, Diana Inkpen, Raj Singh

AI Summary

This paper introduces a diagnostic dataset and NLI task to evaluate how language models handle the proviso problem, a challenge in pragmatics concerning presupposition projection in conditional sentences. Experiments with RoBERTa, DeBERTa, LLaMA, and Gemma reveal that while models often align with human judgments, they tend to rely on surface-level patterns instead of deeper semantic or pragmatic reasoning. The study underscores the necessity of diagnostic datasets and multi-faceted evaluation methods for assessing pragmatic competence in language models.

Key Contribution

LLaMA and Gemma may seem to understand complex conditional statements, but they're really just pattern-matching, not grasping the underlying pragmatic nuances of presuppositions.

Abstract

We investigate how language models handle the proviso problem, an unresolved issue in pragmatics where presuppositions in conditional sentences diverge between theoretical and human interpretations. We reformulate this phenomenon as a Natural Language Inference task and introduce a diagnostic dataset designed to probe presupposition projection in conditionals. We evaluate RoBERTa, DeBERTa, LLaMA, and Gemma using explainability analyses. The results show that models broadly align with human judgments but rely on shallow pattern matching rather than semantic or pragmatic reasoning. Our work provides the first computational evaluation framework for the proviso problem and highlights the need for diagnostic, multi-method approaches to assess pragmatic competence and context-dependent meaning in language models.

Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References41

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Do Language Models Know Theo Has a Wife? Investigating the Proviso Problem

Related Papers