Search papers, labs, and topics across Lattice.
The paper introduces SAILS, a model-agnostic framework that utilizes generalized additive models (GAMs) to analyze pairwise feature interactions in machine learning models. By fitting surrogates to the local effects of black-box models, SAILS not only detects interactions but also categorizes their functional forms and provides interpretable visualizations. Empirical validation shows that SAILS effectively identifies interaction types, although it faces challenges with strong feature correlations and higher-order interactions.
SAILS reveals the functional forms of feature interactions in machine learning models, transforming how we interpret model behavior beyond mere detection.
Feature interactions drive much of the predictive power of machine learning models, yet existing explanation methods only detect and quantify interactions without revealing their functional form, or visualize only restricted interaction types. We propose Surrogate-based Analysis of Interactions via Local effect Smooths (SAILS), a model-agnostic framework that analyzes pairwise interactions through interpretable generalized additive model (GAM) surrogates fitted to the local effects of a black-box model. For each interval of a feature of interest, the surrogate smooth terms isolate the interaction components on derivative level, enabling (i) interaction detection through a heuristic derived from significance tests on smooth terms, (ii) interaction form categorization into linear, product-separable, and non-product-separable types, and (iii) tailored, interpretable visualizations for each interaction type. We empirically validate the framework through controlled simulations and a real-world task, demonstrating its effectiveness for pairwise interactions, with limitations under strong feature correlations and higher-order interactions. SAILS fills a notable gap in the XAI toolbox, going beyond detection of interactions alone to characterizing their functional form.