Mar 18, 2026arXiv:2603.17952

Gender Disambiguation in Machine Translation: Diagnostic Evaluation in Decoder-Only Architectures

Chiara Manna, Hosein Mohebbi, A. Alishahi, Afra Alishahi, Frédéric Blain, Eva Vanmassenhove

AI Summary

This paper introduces "Prior Bias," a new metric to evaluate default gender assumptions in machine translation models, and applies it to decoder-only architectures. They find that decoder-only models don't consistently outperform encoder-decoder models on gender bias metrics, despite their scale. However, instruction tuning improves contextual awareness and reduces masculine prior bias in these models.

Key Contribution

Instruction tuning can reduce masculine bias in decoder-only MT models, but these models still don't consistently outperform encoder-decoder architectures on gender-specific translation tasks.

Abstract

While Large Language Models achieve state-of-the-art results across a wide range of NLP tasks, they remain prone to systematic biases. Among these, gender bias is particularly salient in MT, due to systematic differences across languages in whether and how gender is marked. As a result, translation often requires disambiguating implicit source signals into explicit gender-marked forms. In this context, standard benchmarks may capture broad disparities but fail to reflect the full complexity of gender bias in modern MT. In this paper, we extend recent frameworks on bias evaluation by: (i) introducing a novel measure coined"Prior Bias", capturing a model's default gender assumptions, and (ii) applying the framework to decoder-only MT models. Our results show that, despite their scale and state-of-the-art status, decoder-only models do not generally outperform encoder-decoder architectures on gender-specific metrics; however, post-training (e.g., instruction tuning) not only improves contextual awareness but also reduces the masculine Prior Bias.

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References74

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Gender Disambiguation in Machine Translation: Diagnostic Evaluation in Decoder-Only Architectures

Related Papers