Department of LinguisticsUniversity of MassachusettsUT AustinMay 21, 2026arXiv:2605.22542

Scene Abstraction for Lexical Semantics: Structured Representations of Situated Meaning

AI Summary

The paper introduces Scene Abstraction, a framework for representing the situated meaning of words by constructing structured representations of interpretive scenes using LLM few-shot prompting. These scenes consist of a Contextual Scene (Events, Entities, Setting) and an expression-centered Expression Profile (Engaged events, Generalizable properties, Evoked emotions). The authors demonstrate that these scene profiles are reliably identifiable by humans and align more closely with human interpretation than existing methods.

Key Contribution

Words like "coffee" and "tea" evoke distinct situations and emotions that current word embeddings miss, but can now be captured by prompting LLMs to generate structured "scene" representations.

Abstract

Coffee and tea share many properties, yet they evoke strikingly different situations, atmospheres, and affective associations. These situated dimensions of word meaning are real and systematic, but they remain implicit in most computational representations of lexical meaning. We propose Scene Abstraction, a framework for constructing structured representations of the interpretive scenes that words participate in across usage contexts. Each scene consists of a Contextual Scene (Events, Entities, Setting) and an expression-centered Expression Profile (Engaged events, Generalizable properties, Evoked emotions), operationalized through few-shot prompting of a large language model. Our contributions are three-fold: (1) a structured representation framework for situated lexical meaning; (2) COCA-Scenes, a dataset of 520 usage instances across 26 keywords for distinct scene identification; and (3) empirical evidence from two experiments suggesting that scenes are reliably identifiable across human observers (82.4% accuracy, +11.8 pp over text-only embeddings) and that our scene profiles more closely align with human interpretation of words in context than ATOMIC-based alternatives (86.4% preference across three semantic dimensions).

Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Scene Abstraction for Lexical Semantics: Structured Representations of Situated Meaning

Related Papers