Search papers, labs, and topics across Lattice.
The paper introduces Collaborative Agent Reasoning Engineering (CARE), a structured methodology for engineering LLM agents in scientific domains that uses a three-party workflow involving SMEs, developers, and LLM-based helper agents. CARE transforms informal domain intent into structured specifications, ensuring agent behavior is specifiable, testable, and maintainable through reusable artifacts and stage-gated phases. Evaluation in a scientific use case shows CARE improves development efficiency and complex-query performance.
Forget prompt engineering – a structured methodology using LLM "helper agents" can measurably improve the efficiency and performance of LLM agents in complex scientific domains.
We present Collaborative Agent Reasoning Engineering (CARE), a disciplined methodology for engineering Large Language Model (LLM) agents in scientific domains. Unlike ad-hoc trial-and-error approaches, CARE specifies behavior, grounding, tool orchestration, and verification through reusable artifacts and systematic, stage-gated phases. The methodology employs a three-party workflow involving Subject-Matter Experts (SMEs), developers, and LLM-based helper agents. These helper agents function as facilitation infrastructure, transforming informal domain intent into structured, reviewable specifications for human approval at defined gates. CARE addresses the"jagged technological frontier", characterized by uneven LLM performance, by bridging the gap between novice and expert analysts regarding domain constraints and verification practices. By generating concrete artifacts, including interaction requirements, reasoning policies, and evaluation criteria, CARE ensures agent behavior is specifiable, testable, and maintainable. Evaluation results from a scientific use case demonstrate that this stage-gated, artifact-driven methodology yields measurable improvements in development efficiency and complex-query performance.