Mar 31, 2026arXiv:2603.29723

Reinforced Reasoning for End-to-End Retrosynthetic Planning

Chenyang Zuo, Siqi Fan, Yizhen Luo, Zaiqing Nie

AI Summary

The paper introduces ReTriP, an end-to-end generative framework for retrosynthetic planning that uses Chain-of-Thought reasoning. ReTriP employs a path-coherent molecular representation and a progressive training curriculum, combining reasoning distillation with reinforcement learning using verifiable rewards. Experiments on RetroBench show ReTriP achieves state-of-the-art performance and improved robustness in long-horizon planning compared to hybrid approaches.

Key Contribution

End-to-end retrosynthetic planning, previously reliant on fragmented prediction-search hybrids, now achieves state-of-the-art performance thanks to a unified, reasoning-driven generative framework.

Abstract

Retrosynthetic planning is a fundamental task in organic chemistry, yet remains challenging due to its combinatorial complexity. To address this, conventional approaches typically rely on hybrid frameworks that combine single-step predictions with external search heuristics, inevitably fracturing the logical coherence between local molecular transformations and global planning objectives. To bridge this gap and embed sophisticated strategic foresight directly into the model's chemical reasoning, we introduce ReTriP, an end-to-end generative framework that reformulates retrosynthesis as a direct Chain-of-Thought reasoning task. We establish a path-coherent molecular representation and employ a progressive training curriculum that transitions from reasoning distillation to reinforcement learning with verifiable rewards, effectively aligning stepwise generation with practical route utility. Empirical evaluation on RetroBench demonstrates that ReTriP achieves state-of-the-art performance, exhibiting superior robustness in long-horizon planning compared to hybrid baselines.

Reasoning & Chain-of-Thought Scientific Discovery & Drug Design World Models & Planning

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Reinforced Reasoning for End-to-End Retrosynthetic Planning

Related Papers