PKUApr 19, 2026arXiv:2604.17288

Clover: A Neural-Symbolic Agentic Harness with Stochastic Tree-of-Thoughts for Verified RTL Repair

Zizhang Luo, Yansong Xu, Runlin Guo, Fan Cui, Kexing Zhou, Mile Xia, Hongyuan Hou, Yuhao Luo, Yun Liang

AI Summary

Clover is a neural-symbolic framework designed to enhance RTL program repair by orchestrating a structured search over code manipulations, leveraging both specialized LLM agents and symbolic solvers. This approach addresses the limitations of traditional automatic program repair methods and the randomness of LLMs, employing a stochastic tree-of-thoughts mechanism to manage context and optimize exploration versus exploitation. In evaluations, Clover achieved a remarkable 96.8% bug-fixing rate, significantly outperforming both traditional and LLM-based baselines in terms of bug coverage and reliability.

Key Contribution

Clover fixes nearly all RTL bugs while covering significantly more cases than traditional and LLM-based methods, showcasing a breakthrough in automated program repair.

Abstract

RTL program repair remains a critical bottleneck in hardware design and verification. Traditional automatic program repair (APR) methods rely on predefined templates and synthesis, limiting their bug coverage. Large language models (LLMs) and coding agents based on them offer flexibility but suffer from randomness and context corruption when handling long RTL code and waveforms. We present Clover, a neural-symbolic agentic harness that orchestrates RTL repair as a structured search over code manipulations to explore a validated solution for the bug. Recognizing that different repair operations favor distinct strategies, Clover dynamically dispatches tasks to specialized LLM agents or symbolic solvers. At its core, Clover introduces stochastic tree-of-thoughts, a test-time scaling mechanism that manages the main agent's context as a search tree, balancing exploration and exploitation for reliable outcomes. An RTL-specific toolbox further empowers agents to interact with the debugging environment. Evaluated on the RTL-repair benchmark, Clover fixes 96.8% of bugs within a fixed time limit, covering 94% and 63% more bugs than both pure traditional and LLM-based baselines, respectively, while achieving an average pass@1 rate of 87.5%, demonstrating high reliability and effectiveness.

Code Generation & Program Synthesis Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Clover: A Neural-Symbolic Agentic Harness with Stochastic Tree-of-Thoughts for Verified RTL Repair

Related Papers