Search papers, labs, and topics across Lattice.
A benchmark for evaluating large language models (LLMs) on multi-step geospatial tasks relevant to commercial GIS practitioners is established, and an LLM-as-Judge evaluation framework is developed to compare agent solutions against reference implementations.