Search papers, labs, and topics across Lattice.
This paper introduces the COM-as-Action paradigm, which reframes professional software manipulation as deterministic program synthesis, addressing the limitations of existing GUI and API-based agents. The authors validate this approach through ComCADBench, a benchmark for agents in industrial CAD environments, revealing that COM-based execution significantly outperforms traditional GUI methods. The development of ComActor, a self-correcting agent trained through a three-stage framework, demonstrates state-of-the-art performance and resilience in long-horizon tasks, effectively bridging the gap between syntactic correctness and geometric accuracy.
COM-based execution outperforms GUI interactions, achieving state-of-the-art results in professional software manipulation tasks where traditional methods fail.
Existing computer-use agents remain fundamentally limited in professional software manipulation: GUI-based agents suffer from fragile visual grounding and long-horizon error accumulation, while API-basedapproaches struggle with heterogeneous protocols and inaccessible commercial interfaces. In this work,we identify the Component Object Model (COM) as a unified executable abstraction, proposing COM-as-Action: a new paradigm that reframes professional software interaction as deterministic program synthesisrather than sequential visual control. To validate this paradigm in the most demanding environments, weintroduce ComCADBench, the first benchmark for agents operating real industrial CAD software. Ourexperiments reveal a substantial paradigm gap: frontier proprietary models achieve near-zero successunder GUI-based interaction, whereas COM-based execution yields substantial immediate gains. Tobridge the remaining gap between syntactic correctness and geometric accuracy, we develop ComActor, aself-correcting agent trained through a progressive three-stage framework, alongside ComForge, a scalableplatform for large-scale training in Windows containers. Extensive experiments show that ComActorachieves state-of-the-art performance on ComCADBench, with strong resilience in long-horizon taskswhere baselines collapse, and generalizes to external CAD benchmark.