Search papers, labs, and topics across Lattice.
The paper introduces DISCOVER, a model-agnostic solver for Distributional Counterfactual Explanations (DCE) that addresses the limitations of gradient-based optimization in tabular pipelines with non-differentiable models. DISCOVER preserves the DCE objective of balancing proximity to a factual input distribution and alignment to a target output distribution while replacing gradient descent with a sparse propose-and-select search paradigm guided by OT-guided cone sampling. Experiments on tabular datasets demonstrate DISCOVER's ability to achieve strong alignment of input and output distributions, extending DCE to black-box learning pipelines.
Distributional counterfactual explanations are now possible for black-box tabular models, thanks to a novel sparse search algorithm that sidesteps the need for gradients.
Counterfactual explanations (CE) explain model decisions by identifying input modifications that lead to different predictions. Most existing methods operate at the instance level. Distributional Counterfactual Explanations (DCE) extend this setting by optimizing an optimal transport objective that balances proximity to a factual input distribution and alignment to a target output distribution, with statistical certification via chance constrained bounds. However, DCE relies on gradient based optimization, while many real-world tabular pipelines are dominated by non-differentiable models. We propose DISCOVER, a model-agnostic solver for distributional counterfactual explanations. DISCOVER preserves the original DCE objective and certification while replacing gradient descent with a sparse propose-and-select search paradigm. It exploits a sample-wise decomposition of the transport objective to compute per-row impact scores and enforce a top-$k$ intervention budget, focusing edits on the most influential samples. To guide candidate generation without predictor gradients, DISCOVER introduces an OT-guided cone sampling primitive driven by input-side transport geometry. Experiments on multiple tabular datasets demonstrate strong joint alignment of input and output distributions, extending distributional counterfactual reasoning to modern black box learning pipelines. A code repository is available at https://github.com/understanding-ml/DCE.