Search papers, labs, and topics across Lattice.
The paper introduces `xplainfi`, an R package built on the `mlr3` ecosystem, designed for global, loss-based feature importance analysis in machine learning models. It addresses gaps in existing R packages by implementing conditional importance methods and statistical inference procedures, including permutation feature importance, conditional feature importance, and Shapley additive global importance. The package offers a modular conditional sampling architecture and statistical inference via variance-corrected confidence intervals and the conditional predictive impact framework, providing a comprehensive toolkit for feature importance analysis and model interpretation.
Unlock robust feature importance analysis with `xplainfi`, an R package that fills critical gaps by offering conditional importance methods and statistical inference for diverse ML models.
We introduce xplainfi, an R package built on top of the mlr3 ecosystem for global, loss-based feature importance methods for machine learning models. Various feature importance methods exist in R, but significant gaps remain, particularly regarding conditional importance methods and associated statistical inference procedures. The package implements permutation feature importance, conditional feature importance, relative feature importance, leave-one-covariate-out, and generalizations thereof, and both marginal and conditional Shapley additive global importance methods. It provides a modular conditional sampling architecture based on Gaussian distributions, adversarial random forests, conditional inference trees, and knockoff-based samplers, which enable conditional importance analysis for continuous and mixed data. Statistical inference is available through multiple approaches, including variance-corrected confidence intervals and the conditional predictive impact framework. We demonstrate that xplainfi produces importance scores consistent with existing implementations across multiple simulation settings and learner types, while offering competitive runtime performance. The package is available on CRAN and provides researchers and practitioners with a comprehensive toolkit for feature importance analysis and model interpretation in R.