Search papers, labs, and topics across Lattice.
This paper introduces Mixed Synthetic Nearest Neighbors (MSNN), a causal matrix completion estimator that addresses the limitations of Synthetic Nearest Neighbors (SNN) when dealing with multiple treatments and scarce data within specific treatment levels. MSNN integrates information across different treatment levels to enlarge the effective sample size, thereby improving estimation accuracy. The authors demonstrate that MSNN preserves the theoretical guarantees of SNN, including finite-sample error bounds and asymptotic normality, and validate its performance empirically on synthetic and real-world datasets.
Overcome data scarcity in causal matrix completion with multiple treatments by borrowing information across treatment levels, achieving better accuracy without sacrificing theoretical guarantees.
Synthetic Nearest Neighbors (SNN) provides a principled solution to causal matrix completion under missing-not-at-random (MNAR) by exploiting local low-rank structure through fully observed anchor submatrices. However, its effectiveness critically relies on sufficient data availability within each treatment level, a condition that often fails in settings with multiple or complex treatments. In this work, we propose Mixed Synthetic Nearest Neighbors (MSNN), a new entry-wise causal identification estimator that integrates information across treatment levels. We show that MSNN retains the finite-sample error bounds and asymptotic normality guarantees of SNN, while enlarging the effective sample size available for estimation. Empirical results on synthetic and real-world datasets illustrate the efficacy of the proposed approach, especially under data-scarce treatment levels.