Search papers, labs, and topics across Lattice.
This paper introduces an adaptive group elicitation framework that leverages LLMs and graph neural networks to efficiently query individuals for inferring population-level responses under budget constraints. The method combines an LLM-based expected information gain objective for question selection with a heterogeneous graph neural network that imputes missing responses and guides respondent selection based on observed responses and participant attributes. Experiments on three real-world opinion datasets demonstrate that the proposed approach significantly improves population-level response prediction, achieving over 12% relative gain on the Cooperative Congressional Election Study (CES) dataset with only a 10% respondent budget.
LLMs can now adaptively survey the *right* people, not just ask the *right* questions, boosting response prediction by 12% while querying only 10% of the population.
Eliciting information to reduce uncertainty about latent group-level properties from surveys and other collective assessments requires allocating limited questioning effort under real costs and missing data. Although large language models enable adaptive, multi-turn interactions in natural language, most existing elicitation methods optimize what to ask with a fixed respondent pool, and do not adapt respondent selection or leverage population structure when responses are partial or incomplete. To address this gap, we study adaptive group elicitation, a multi-round setting where an agent adaptively selects both questions and respondents under explicit query and participation budgets. We propose a theoretically grounded framework that combines (i) an LLM-based expected information gain objective for scoring candidate questions with (ii) heterogeneous graph neural network propagation that aggregates observed responses and participant attributes to impute missing responses and guide per-round respondent selection. This closed-loop procedure queries a small, informative subset of individuals while inferring population-level responses via structured similarity. Across three real-world opinion datasets, our method consistently improves population-level response prediction under constrained budgets, including a>12% relative gain on CES at a 10% respondent budget.