Search papers, labs, and topics across Lattice.
This paper introduces a directed graph model for time-dependent text visualization, aiming to understand how effectively users interpret relationships in such visualizations. They define motifs representing possible text link patterns across time and create a controlled synthetic text generation methodology using LLMs to generate structured, fictional datasets for each pattern. A user study (n=30) revealed that identifying predefined motifs in these visualizations is challenging, highlighting complexities in both user interpretation and LLM-based synthetic data generation.
Visualizing evolving narratives with time-dependent text networks is harder than we thought: users struggle to interpret even simple motifs, and LLMs introduce unexpected complexities when generating synthetic data for evaluation.
Exponential growth in the quantity of digital news, social media, and other textual sources makes it difficult for humans to keep up with rapidly evolving narratives about world events. Various visualisation techniques have been touted to help people to understand such discourse by exposing relationships between texts (such as news articles) as topics and themes evolve over time. Arguably, the understandability of such visualisations hinges on the assumption that people will be able to easily interpret the relationships in such visual network structures. To test this assumption, we begin by defining an abstract model of time-dependent text visualisation based on directed graph structures. From this model we distill motifs that capture the set of possible ways that texts can be linked across changes in time. We also develop a controlled synthetic text generation methodology that leverages the power of modern LLMs to create fictional, yet structured sets of time-dependent texts that fit each of our patterns. Therefore, we create a clean user study environment (n=30) for participants to identify patterns that best represent a given set of synthetic articles. We find that it is a challenging task for the user to identify and recover the predefined motif. We analyse qualitative data to map an unexpectedly rich variety of user rationales when divergences from expected interpretation occur. A deeper analysis also points to unexpected complexities inherent in the formation of synthetic datasets with LLMs that undermine the study control in some cases. Furthermore, analysis of individual decision-making in our study hints at a future where text discourse visualisation may need to dispense with a one-size-fits-all approach and, instead, should be more adaptable to the specific user who is exploring the visualisation in front of them.