Search papers, labs, and topics across Lattice.
The paper investigates why neural agents in the NeLLCom-Lex framework develop color lexicons that, while pragmatic, lack the convexity characteristic of human color categories. They address this by introducing two key modifications: upsampling rare color terms during supervised learning to improve lexical diversity, and using multi-listener reinforcement learning to promote more convex color categories. Results show that a combination of moderate upsampling and multiple listeners yields lexicons that more closely resemble human color naming systems, as measured by a convexity metric.
Human-like color categories can emerge in neural agents by simply upsampling rare color terms and training with multiple listeners, closing the gap between artificial and human color lexicons.
Modeling the emergence of human-like lexicons in computational systems has advanced through the use of interacting neural agents, which simulate both learning and communicative pressures. The NeLLCom-Lex framework (Zhang et al., 2025) allows neural agents to develop pragmatic color naming behavior and human-like lexicons through supervised learning (SL) from human data and reinforcement learning (RL) in referential games. Despite these successes, the lexicons that emerge diverge systematically from human color categories, producing highly non-convex regions in color space, which contrast with the convexity typical of human categories. To address this, we introduce two factors, upsampling rare color terms during SL and multi-listener RL interactions, and adopt a convexity measure to quantify geometric coherence. We find that upsampling improves lexical diversity and system-level informativeness of the color lexicon, while many-listener setups promote more convex color categories. The combination of moderate upsampling and multiple listeners produces lexicons most similar to human systems.