DAMOHITHumanoid Robot (Shanghai) Co.School of InformaticsSoochowApr 20, 2026arXiv:2604.17972

Modeling Multiple Support Strategies within a Single Turn for Emotional Support Conversations

Jie Zhu, Huaixia Dou, Junhui Li, Lifan Guo, Feng Chen, Jinsong Su, Chi Zhang, Fang Kong

AI Summary

This paper addresses the Emotional Support Conversation (ESC) task by introducing a multi-strategy utterance generation approach, recognizing that real-world supportive dialogues often incorporate multiple strategies within a single turn. Two novel generation methods, All-in-One and One-by-One, are proposed, with enhancements through cognitive reasoning and reinforcement learning to optimize strategy selection and response composition. Experimental evaluations on the ESConv dataset demonstrate that these methods significantly improve the quality of support and overall dialogue success, marking a pivotal advancement in the modeling of emotional support interactions.

Key Contribution

Allowing multiple support strategies in a single utterance can dramatically enhance the quality of emotional support conversations, leading to more effective dialogue outcomes.

Abstract

Emotional Support Conversation (ESC) aims to assist individuals experiencing distress by generating empathetic and supportive dialogue. While prior work typically assumes that each supporter turn corresponds to a single strategy, real-world supportive communication often involves multiple strategies within a single utterance. In this paper, we revisit the ESC task by formulating it as multi-strategy utterance generation, where each utterance may contain one or more strategy-response pairs. We propose two generation methods: All-in-One, which predicts all strategy-response pairs in a single decoding step, and One-by-One, which iteratively generates strategy-response pairs until completion. Both methods are further enhanced with cognitive reasoning guided by reinforcement learning to improve strategy selection and response composition. We evaluate our models on the ESConv dataset under both utterance-level and dialogue-level settings. Experimental results show that our methods effectively model multi-strategy utterances and lead to improved supportive quality and dialogue success. To our knowledge, this work provides the first systematic empirical evidence that allowing multiple support strategies within a single utterance is both feasible and beneficial for emotional support conversations. All code and data will be publicly available at https://github.com/aliyun/qwen-dianjin.

Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Modeling Multiple Support Strategies within a Single Turn for Emotional Support Conversations

Related Papers