Search papers, labs, and topics across Lattice.
The paper introduces the task of expectation detection in natural language, motivated by the impact of patient expectations on treatment success and the potential of online forums to reveal these expectations. They create RedHOTExpect, a corpus of 4.5K Reddit posts related to medical treatments, and use a large language model for silver-labeling, achieving approximately 78% label accuracy after manual validation. Their analysis of the corpus reveals linguistic patterns characterizing expectations and differences in expectation framing between physical/treatment-related illnesses and mental health contexts.
Reddit reveals hidden patient expectations about medical treatments, showing a bias towards optimism and proactive framing in physical health discussions versus mental health.
Patients' expectations towards their treatment have a substantial effect on the treatments' success. While primarily studied in clinical settings, online patient platforms like medical subreddits may hold complementary insights: treatment expectations that patients feel unnecessary or uncomfortable to share elsewhere. Despite this, no studies examine what type of expectations users discuss online and how they express them. Presumably this is because expectations have not been studied in natural language processing (NLP) before. Therefore, we introduce the task of Expectation Detection, arguing that expectations are relevant for many applications, including opinion mining and product design. Subsequently, we present a case study for the medical domain, where expectations are particularly crucial to extract. We contribute RedHOTExpect, a corpus of Reddit posts (4.5K posts) to study expectations in this context. We use a large language model (LLM) to silver-label the data and validate its quality manually (label accuracy ~78%). Based on this, we analyze which linguistic patterns characterize expectations and explore what patients expect and why. We find that optimism and proactive framing are more pronounced in posts about physical or treatment-related illnesses compared to mental-health contexts, and that in our dataset, patients mostly discuss benefits rather than negative outcomes. The RedHOTExpect corpus can be obtained from https://www.ims.uni-stuttgart.de/data/RedHOTExpect