Search papers, labs, and topics across Lattice.
The paper introduces TruthStance, a new dataset of 24,378 posts and 523,360 comments from Truth Social spanning 2023-2025, designed to facilitate research in argument mining and stance detection on alt-tech platforms. The authors provide a human-annotated benchmark of 1,500 instances for argument mining and claim-based stance detection, evaluating LLM prompting strategies on this benchmark. They then use the best-performing LLM configuration to generate labels for a larger portion of the dataset, releasing both the human-annotated and LLM-generated labels to enable further analysis.
Finally, a large-scale dataset for studying argument mining and stance detection on Truth Social, an under-studied alt-tech platform, opens the door to understanding opinion dynamics in polarized online spaces.
Argument mining and stance detection are central to understanding how opinions are formed and contested in online discourse. However, most publicly available resources focus on mainstream platforms such as Twitter and Reddit, leaving conversational structure on alt-tech platforms comparatively under-studied. We introduce TruthStance, a large-scale dataset of Truth Social conversation threads spanning 2023-2025, consisting of 24,378 posts and 523,360 comments with reply-tree structure preserved. We provide a human-annotated benchmark of 1,500 instances across argument mining and claim-based stance detection, including inter-annotator agreement, and use it to evaluate large language model (LLM) prompting strategies. Using the best-performing configuration, we release additional LLM-generated labels for 24,352 posts (argument presence) and 107,873 comments (stance to parent), enabling analysis of stance and argumentation patterns across depth, topics, and users. All code and data are released publicly.