Apr 30, 2026arXiv:2604.27920

Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation

AI Summary

This paper benchmarks the ability of three SLMs (EuroLLM, Aya Expanse, and Gemma) to preserve fine-grained emotions during backtranslation across five European languages using the GoEmotions dataset. They find that emotion-aware prompting improves emotional preservation, and that ModernBERT performs comparably to BERT for emotion classification in this MT evaluation task. The study highlights the challenges SLMs face in maintaining affective nuance, even with targeted prompting strategies.

Key Contribution

Even with emotion-aware prompting, today's best small language models still struggle to preserve subtle emotional nuances when translating between languages.

Abstract

Preserving affective nuance remains a challenge in Machine Translation (MT), where semantic equivalence often takes precedence over emotional fidelity. This paper evaluates the performance of three state-of-the-art Small Language Models (SLMs) -- EuroLLM, Aya Expanse, and Gemma -- in maintaining fine-grained emotions during backtranslation. Using the GoEmotions dataset, which comprises Reddit comments across 28 distinct categories, we assess emotional preservation across five European languages: German, French, Spanish, Italian, and Polish. Specifically, we investigate (i) the inherent capability of these SLMs to retain emotional sentiment, (ii) the efficacy of emotion-aware prompting in improving preservation, and (iii) the performance of ModernBERT as a contemporary alternative to BERT for emotion classification in MT evaluation.

Eval Frameworks & Benchmarks Natural Language Processing Open-Source Models & Weights

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Beyond Semantics: Measuring Fine-Grained Emotion Preservation in Small Language Model-Based Machine Translation

Related Papers