Mar 4, 2026arXiv:2603.04224

Nearest-Neighbor Density Estimation for Dependency Suppression

AI Summary

This paper introduces a variational autoencoder (VAE) framework for removing statistical dependencies between data and sensitive variables. The key innovation is a loss function based on nearest-neighbor density estimation, which directly optimizes for independence in the learned representation space. Experiments on multiple datasets show the method outperforms existing unsupervised techniques and approaches the performance of supervised methods in balancing information removal and utility.

Key Contribution

Forget adversarial training – this VAE uses nearest-neighbor density estimation to directly optimize independence, outperforming existing unsupervised methods for dependency suppression.

Abstract

The ability to remove unwanted dependencies from data is crucial in various domains, including fairness, robust learning, and privacy protection. In this work, we propose an encoder-based approach that learns a representation independent of a sensitive variable but otherwise preserving essential data characteristics. Unlike existing methods that rely on decorrelation or adversarial learning, our approach explicitly estimates and modifies the data distribution to neutralize statistical dependencies. To achieve this, we combine a specialized variational autoencoder with a novel loss function driven by non-parametric nearest-neighbor density estimation, enabling direct optimization of independence. We evaluate our approach on multiple datasets, demonstrating that it can outperform existing unsupervised techniques and even rival supervised methods in balancing information removal and utility.

Constitutional AI & AI Ethics Data Curation & Synthetic Data Red-Teaming & Adversarial Robustness

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Nearest-Neighbor Density Estimation for Dependency Suppression

Related Papers