Apr 23, 2026arXiv:2604.21909

Directional Confusions Reveal Divergent Inductive Biases Through Rate-Distortion Geometry in Human and Machine Vision

AI Summary

This paper analyzes directional confusions between humans and deep vision models on a natural image categorization task under various perturbations, revealing divergent inductive biases. The authors quantify asymmetry in confusion matrices using a Rate-Distortion (RD) framework, characterized by geometric signatures (slope, curvature) and efficiency (AUC). They find that humans exhibit broad, weak asymmetries, while deep vision models show sparse, strong directional collapses, highlighting differences in generalization geometry despite similar classification accuracy.

Key Contribution

Despite achieving comparable accuracy, humans and deep vision models exhibit fundamentally different error patterns, revealing distinct inductive biases that can be quantified through directional confusion analysis and Rate-Distortion geometry.

Abstract

Humans and modern vision models can reach similar classification accuracy while making systematically different kinds of mistakes - differing not in how often they err, but in who gets mistaken for whom, and in which direction. We show that these directional confusions reveal distinct inductive biases that are invisible to accuracy alone. Using matched human and deep vision model responses on a natural-image categorization task under 12 perturbation types, we quantify asymmetry in confusion matrices and link it to generalization geometry through a Rate-Distortion (RD) framework, summarized by three geometric signatures (slope (beta), curvature (kappa)) and efficiency (AUC). We find that humans exhibit broad but weak asymmetries, whereas deep vision models show sparser, stronger directional collapses. Robustness training reduces global asymmetry but fails to recover the human-like breadth-strength profile of graded similarity. Mechanistic simulations further show that different asymmetry organizations shift the RD frontier in opposite directions, even when matched for performance. Together, these results position directional confusions and RD geometry as compact, interpretable signatures of inductive bias under distribution shift.

Computer Vision Interpretability & Mechanistic Interp

Citation Metrics

Citations0

Influential citations0

References35

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Directional Confusions Reveal Divergent Inductive Biases Through Rate-Distortion Geometry in Human and Machine Vision

Related Papers