Search papers, labs, and topics across Lattice.
This paper introduces the Causal Ensemble Agent (CEA), a framework that enhances causal discovery by integrating insights from various statistical discovery experts and leveraging a Large Language Model (LLM) for dynamic expert reweighting. By employing linear opinion pooling, CEA effectively navigates the complexities of conflicting causal structures and improves the accuracy of causal graphs, especially when expert confidence is ambiguous. Experimental results on synthetic and real-world datasets reveal that CEA outperforms existing causal discovery methods, underscoring the potential of LLMs in refining causal inference processes.
Aggregating insights from diverse causal discovery experts with LLM-guided reweighting leads to significantly improved causal graph accuracy, even in ambiguous scenarios.
Causal discovery aims to uncover causal structures from observational data, which is crucial for real-world decision-making. However, different causal discovery algorithms can produce divergent results that conflict with each other, complicating the identification of accurate causal graphs. Traditional approaches rely on numerical values and statistical assumptions, often ignoring rich domain-specific information, such as feature descriptions, which could also help structure learning. While recent works explore using Large Language Models (LLMs) to infer causal relations via direct queries, such methods can be unreliable due to a lack of alignment with the actual data. To address these limitations, we propose Causal Ensemble Agent (CEA), a novel framework that aggregates structural insights from statistical discovery experts across different graph levels via linear opinion pooling, and uses an LLM as a meta-referee to dynamically reweight experts when the aggregated confidence is close to the decision boundary, thereby composing an improved and more complete causal graph. Extensive experiments on both synthetic and real-world datasets demonstrate that CEA achieves the strongest overall performance across a wide range of causal discovery methods, highlighting the effectiveness of using LLMs for meta-analysis in causal discovery.