May 6, 2026arXiv:2605.05170

Design Conductor 2.0: An agent builds a TurboQuant inference accelerator in 80 hours

The Verkor Team, Ravi Krishna, Suresh Krishna, David Chin

AI Summary

Design Conductor 2.0, a multi-agent system powered by LLMs released in April 2026, autonomously designed an LLM inference accelerator called VerTQ with hard-wired TurboQuant support. VerTQ features a 240-cycle pipeline with 5129 FP16/32 units and was mapped to an FPGA at 125 MHz, consuming 5.7 mm^2 in TSMC 16FF. This demonstrates a significant advancement in automated hardware design capabilities of AI agents, handling 80x larger tasks than its predecessor.

Key Contribution

LLM agents can now autonomously design complex hardware like an LLM inference accelerator with hard-wired TurboQuant support in just 80 hours.

Abstract

Driven by a rapid co-evolution of both harness and underlying models, LLM agents are improving at a dizzying pace. In our prior work (performed in Dec. 2025), we introduced "Design Conductor" (or just "Conductor"), a system capable of building a 5-stage Linux-capable RISC-V CPU in 12 hours. In this work, we introduce an updated multi-agent harness powered by frontier models released in April 2026, which is able to handle 80x larger tasks, at higher quality, fully autonomously. Following a brief introduction, we examine 4 designs that the system produced autonomously, including "VerTQ", an LLM inference accelerator which hard-wires support for TurboQuant in a 240-cycle pipeline, starting from the TurboQuant arXiv paper. VerTQ includes heavy compute processing, with 5129 FP16/32 units; the design was mapped to an FPGA at 125 MHz and consumes 5.7 mm^2 in TSMC 16FF (8 attention pipes). We review the key new characteristics that enabled these results. Finally, we analyze Design Conductor's token usage and other empirical characteristics, including its limitations.

Code Generation & Program Synthesis Inference & Quantization Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Design Conductor 2.0: An agent builds a TurboQuant inference accelerator in 80 hours

Related Papers