Search papers, labs, and topics across Lattice.
This paper proposes a framework for causal discovery that leverages the wisdom of the crowd by integrating crowdsourcing, expert knowledge elicitation, and LLM-based simulation. It frames DAG learning as a distributed decision-making task where individual agents (human experts or LLMs) possess partial knowledge of the causal graph. The key result is a systematic framework to synthesize these fragmented insights, enabling the recovery of a global causal structure that no single agent could achieve alone.
Forget solo causal discovery – a new framework shows how to combine human experts, crowdsourcing, and LLMs to unlock causal structures previously hidden from individual agents.
Learning causal structures typically represented by directed acyclic graphs (DAGs) from observational data is notoriously challenging due to the combinatorial explosion of possible graphs and inherent ambiguities in observations. This paper argues that causal learning is now ready for the emergence of a new paradigm supported by rapidly advancing technologies, fulfilling the long-standing vision of leveraging human causal knowledge. This paradigm integrates scalable crowdsourcing platforms for data collection, interactive knowledge elicitation for expert opinion modeling, robust aggregation techniques for expert reconciliation, and large language model (LLM)-based simulation for augmenting AI-driven information acquisition. In this paper, we focus on DAG learning for causal discovery and frame the problem as a distributed decision-making task, recognizing that each participant (human expert or LLM agent) possesses fragmented and imperfect knowledge about different subsets of the variables of interest in the causal graph. By proposing a systematic framework to synthesize these insights, we aim to enable the recovery of a global causal structure unachievable by any individual agent alone.We advocate for a new research frontier and outline a comprehensive framework for new research thrusts that range from eliciting, modeling, aggregating, and optimizing human causal knowledge contributions.