DeepMindHelmholtzMax PlanckTU BerlinTU DresdenTU MunichUCLUniversity of ColoradoApr 15, 2026arXiv:2604.13883

Context Sensitivity Improves Human-Machine Visual Alignment

Frieda Born, Tom Neuhäuser, Lukas Muttenthaler, Brett D. Roads, Bernhard Spitzer, Andrew K. Lampinen, Matt Jones, Klaus-Robert Müller, Michael C. Mozer

AI Summary

This paper introduces a context-sensitive similarity computation method for neural network embeddings, drawing inspiration from human perception. The method incorporates an anchor image as context when modeling a triplet odd-one-out task. Results show up to a 15% improvement in odd-one-out accuracy compared to context-insensitive models, consistently across various vision foundation models.

Key Contribution

Human-inspired context sensitivity boosts visual reasoning in machines, closing the gap between AI and human perception.

Abstract

Modern machine learning models typically represent inputs as fixed points in a high-dimensional embedding space. While this approach has been proven powerful for a wide range of downstream tasks, it fundamentally differs from the way humans process information. Because humans are constantly adapting to their environment, they represent objects and their relationships in a highly context-sensitive manner. To address this gap, we propose a method for context-sensitive similarity computation from neural network embeddings, applied to modeling a triplet odd-one-out task with an anchor image serving as simultaneous context. Modeling context enables us to achieve up to a 15% improvement in odd-one-out accuracy over a context-insensitive model. We find that this improvement is consistent across both original and "human-aligned" vision foundation models.

Computer Vision Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Context Sensitivity Improves Human-Machine Visual Alignment

Related Papers