Côte d'AzurFeb 19, 2026arXiv:2602.17467

PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions

Greta Damo, Stéphane Petiot, Serena Villata

AI Summary

The paper introduces PEACE 2.0, a tool designed to analyze hate speech, provide grounded explanations for its classification, and generate counter-speech responses. It leverages a Retrieval-Augmented Generation (RAG) pipeline to ground hate speech explanations in evidence and facts, and to automatically generate evidence-grounded counter-speech. The tool also explores characteristics of counter-speech replies, addressing both explicit and implicit hateful messages.

Key Contribution

Imagine a world where AI not only flags hate speech but also crafts well-reasoned, evidence-backed responses, all thanks to PEACE 2.0's RAG-powered counter-speech generation.

Abstract

The increasing volume of hate speech on online platforms poses significant societal challenges. While the Natural Language Processing community has developed effective methods to automatically detect the presence of hate speech, responses to it, called counter-speech, are still an open challenge. We present PEACE 2.0, a novel tool that, besides analysing and explaining why a message is considered hateful or not, also generates a response to it. More specifically, PEACE 2.0 has three main new functionalities: leveraging a Retrieval-Augmented Generation (RAG) pipeline i) to ground HS explanations into evidence and facts, ii) to automatically generate evidence-grounded counter-speech, and iii) exploring the characteristics of counter-speech replies. By integrating these capabilities, PEACE 2.0 enables in-depth analysis and response generation for both explicit and implicit hateful messages.

Constitutional AI & AI Ethics Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions

Related Papers