Search papers, labs, and topics across Lattice.
The paper introduces Experiment Automation Agents (EAA), a vision-language model (VLM) agentic system for automating experimental microscopy workflows, addressing the need for increased efficiency and accessibility in beamline experiments. EAA leverages multimodal reasoning, tool-augmented actions, and optional long-term memory within a flexible task-manager architecture to enable both fully autonomous and user-guided experimental procedures. The system is demonstrated at the Advanced Photon Source, showing automation of tasks like zone plate focusing and feature search, thus improving beamline efficiency and lowering the expertise barrier for users.
Imagine automating complex microscopy experiments with a vision-language model agent that understands natural language instructions and controls instruments directly.
We present Experiment Automation Agents (EAA), a vision-language-model-driven agentic system designed to automate complex experimental microscopy workflows. EAA integrates multimodal reasoning, tool-augmented action, and optional long-term memory to support both autonomous procedures and interactive user-guided measurements. Built on a flexible task-manager architecture, the system enables workflows ranging from fully agent-driven automation to logic-defined routines that embed localized LLM queries. EAA further provides a modern tool ecosystem with two-way compatibility for Model Context Protocol (MCP), allowing instrument-control tools to be consumed or served across applications. We demonstrate EAA at an imaging beamline at the Advanced Photon Source, including automated zone plate focusing, natural language-described feature search, and interactive data acquisition. These results illustrate how vision-capable agents can enhance beamline efficiency, reduce operational burden, and lower the expertise barrier for users.