Search papers, labs, and topics across Lattice.
The paper introduces PerturbDiff, a diffusion-based generative model operating on distributions in a Hilbert space, to predict single-cell perturbation responses while accounting for unobserved latent factors. PerturbDiff models the shift between control and perturbed cell distributions directly, rather than mapping individual cells, to capture population-level response variability. Experiments on benchmark datasets demonstrate state-of-the-art performance in single-cell response prediction and improved generalization to unseen perturbations compared to existing methods.
By modeling distributions of cellular states with diffusion, PerturbDiff captures the hidden variability in single-cell perturbation responses that existing methods miss.
Building Virtual Cells that can accurately simulate cellular responses to perturbations is a long-standing goal in systems biology. A fundamental challenge is that high-throughput single-cell sequencing is destructive: the same cell cannot be observed both before and after a perturbation. Thus, perturbation prediction requires mapping unpaired control and perturbed populations. Existing models address this by learning maps between distributions, but typically assume a single fixed response distribution when conditioned on observed cellular context (e.g., cell type) and the perturbation type. In reality, responses vary systematically due to unobservable latent factors such as microenvironmental fluctuations and complex batch effects, forming a manifold of possible distributions for the same observed conditions. To account for this variability, we introduce PerturbDiff, which shifts modeling from individual cells to entire distributions. By embedding distributions as points in a Hilbert space, we define a diffusion-based generative process operating directly over probability distributions. This allows PerturbDiff to capture population-level response shifts across hidden factors. Benchmarks on established datasets show that PerturbDiff achieves state-of-the-art performance in single-cell response prediction and generalizes substantially better to unseen perturbations. See our project page (https://katarinayuan.github.io/PerturbDiff-ProjectPage/), where code and data will be made publicly available (https://github.com/DeepGraphLearning/PerturbDiff).