Search papers, labs, and topics across Lattice.
The paper introduces GRAPHIC, a novel architecture-agnostic method for analyzing neural network learning dynamics at the class level. It constructs directed graphs from confusion matrices generated by linear classifiers applied to intermediate layer activations, enabling the application of network science tools. This approach reveals insights into class separability, dataset biases, and architectural behavior, demonstrating its utility through examples like uncovering similarities between "flatfish" and "man" and identifying labeling ambiguities.
Uncover hidden relationships in your neural network's intermediate layers by visualizing class confusion as evolving directed graphs, revealing dataset issues and architectural quirks.
Explainable artificial intelligence has emerged as a promising field of research to address reliability concerns in artificial intelligence. Despite significant progress in explainable artificial intelligence, few methods provide a systematic way to visualize and understand how classes are confused and how their relationships evolve as training progresses. In this work, we present GRAPHIC, an architecture-agnostic approach that analyzes neural networks on a class level. It leverages confusion matrices derived from intermediate layers using linear classifiers. We interpret these as adjacency matrices of directed graphs, allowing tools from network science to visualize and quantify learning dynamics across training epochs and intermediate layers. GRAPHIC provides insights into linear class separability, dataset issues, and architectural behavior, revealing, for example, similarities between flatfish and man and labeling ambiguities validated in a human study. In summary, by uncovering real confusions, GRAPHIC offers new perspectives on how neural networks learn. The code is available at https://github.com/Johanna-S-Froehlich/GRAPHIC.