Search papers, labs, and topics across Lattice.
Design Conductor 2.0, a multi-agent system powered by LLMs released in April 2026, autonomously designed an LLM inference accelerator called VerTQ with hard-wired TurboQuant support. VerTQ features a 240-cycle pipeline with 5129 FP16/32 units and was mapped to an FPGA at 125 MHz, consuming 5.7 mm^2 in TSMC 16FF. This demonstrates a significant advancement in automated hardware design capabilities of AI agents, handling 80x larger tasks than its predecessor.
LLM agents can now autonomously design complex hardware like an LLM inference accelerator with hard-wired TurboQuant support in just 80 hours.
Driven by a rapid co-evolution of both harness and underlying models, LLM agents are improving at a dizzying pace. In our prior work (performed in Dec. 2025), we introduced "Design Conductor" (or just "Conductor"), a system capable of building a 5-stage Linux-capable RISC-V CPU in 12 hours. In this work, we introduce an updated multi-agent harness powered by frontier models released in April 2026, which is able to handle 80x larger tasks, at higher quality, fully autonomously. Following a brief introduction, we examine 4 designs that the system produced autonomously, including "VerTQ", an LLM inference accelerator which hard-wires support for TurboQuant in a 240-cycle pipeline, starting from the TurboQuant arXiv paper. VerTQ includes heavy compute processing, with 5129 FP16/32 units; the design was mapped to an FPGA at 125 MHz and consumes 5.7 mm^2 in TSMC 16FF (8 attention pipes). We review the key new characteristics that enabled these results. Finally, we analyze Design Conductor's token usage and other empirical characteristics, including its limitations.