Apr 21, 2026arXiv:2604.19606

AblateCell: A Reproduce-then-Ablate Agent for Virtual Cell Repositories

Xue Xia, Chengkai Yao, Mingyu Tsoi, Xinjie Mao, Wenxuan Huang, Jiaqi Wei, Cheng Tan, Lang Yu, Yuejin Yang, Siqi Sun, Zhangyang Gao

AI Summary

AblateCell, a reproduce-then-ablate agent, addresses the lack of systematic ablations in AI Virtual Cell research due to under-standardized biological repositories. It reproduces reported baselines by auto-configuring environments and resolving dependencies, then performs closed-loop ablation by generating and adaptively selecting repository mutations based on performance impact and execution cost. AblateCell achieves high success rates in end-to-end workflow completion (88.9%) and accurately identifies critical components (93.3%) across three single-cell perturbation prediction repositories, outperforming human experts and heuristics.

Key Contribution

AI can now automatically reverse-engineer and rigorously validate complex biological simulations, pinpointing the key components driving performance with superhuman accuracy.

Abstract

Systematic ablations are essential to attribute performance gains in AI Virtual Cells, yet they are rarely performed because biological repositories are under-standardized and tightly coupled to domain-specific data and formats. While recent coding agents can translate ideas into implementations, they typically stop at producing code and lack a verifier that can reproduce strong baselines and rigorously test which components truly matter. We introduce AblateCell, a reproduce-then-ablate agent for virtual cell repositories that closes this verification gap. AblateCell first reproduces reported baselines end-to-end by auto-configuring environments, resolving dependency and data issues, and rerunning official evaluations while emitting verifiable artifacts. It then conducts closed-loop ablation by generating a graph of isolated repository mutations and adaptively selecting experiments under a reward that trades off performance impact and execution cost. Evaluated on three single-cell perturbation prediction repositories (CPA, GEARS, BioLORD), AblateCell achieves 88.9% (+29.9% to human expert) end-to-end workflow success and 93.3% (+53.3% to heuristic) accuracy in recovering ground-truth critical components. These results enable scalable, repository-grounded verification and attribution directly on biological codebases.

Code Generation & Program Synthesis Scientific Discovery & Drug Design Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

AblateCell: A Reproduce-then-Ablate Agent for Virtual Cell Repositories

Related Papers