Search papers, labs, and topics across Lattice.
This paper introduces BioStance, a novel dataset comprising 39,600 annotated Post-Comment pairs from Reddit, specifically designed for stance detection in bioethical debates. The dataset captures the nuanced, context-dependent nature of discussions surrounding six controversial bioethical topics, annotated by three independent raters with a high reliability score (Krippendorff's 伪 = 0.82). By providing a rich resource that incorporates hierarchical conversational context, BioStance facilitates advancements in context-aware stance detection and argument mining within the realm of bioethics.
BioStance reveals that nuanced bioethical discussions on social media can be effectively modeled with high reliability, challenging the limitations of existing stance detection datasets.
Bioethical debates increasingly unfold on social media, yet stance detection research lacks large-scale, domain-specific resources for modeling such context-dependent discourse. We present BioStance, a context-aware dataset of 39,600 annotated Post-Comment pairs from Reddit bioethical discussions. BioStance covers six controversial targets across three dimensions of bioethical controversy: fundamental value conflicts, individual liberty versus collective responsibility, and technological uncertainty. Each instance preserves hierarchical conversational context and is labeled by three independent annotators using a three-class stance scheme: Favor, Against, and None. The annotations achieve a mean Krippendorff's $\alpha$ of 0.82, indicating substantial reliability. By combining thematic diversity, conversational structure, and high-quality human annotation, BioStance supports research on context-aware stance detection, argument mining, and computational analysis of bioethical discourse.