Search papers, labs, and topics across Lattice.
This paper introduces a pipeline to extract causal relationships as perceived by LLMs, rather than ground truth causality, by sampling documents from the LLM on a topic, extracting and grouping events into canonical forms, and then applying causal discovery methods to construct candidate causal graphs. The approach aims to provide an inspectable set of causal hypotheses that LLMs plausibly assume, offering insights into the models' internal worldviews. Experiments demonstrate the feasibility of the pipeline in eliciting these relationships.
Uncover your LLM's hidden worldviews: this pipeline extracts and visualizes the causal relationships a language model believes to be true.
Large language models (LLMs) are trained on enormous amounts of data and encode knowledge in their parameters. We propose a pipeline to elicit causal relationships from LLMs. Specifically, (i) we sample many documents from LLMs on a given topic, (ii) we extract an event list from from each document, (iii) we group events that appear across documents into canonical events, (iv) we construct a binary indicator vector for each document over canonical events, and (v) we estimate candidate causal graphs using causal discovery methods. Our approach does not guarantee real-world causality. Rather, it provides a framework for presenting the set of causal hypotheses that LLMs can plausibly assume, as an inspectable set of variables and candidate graphs.