May 26, 2026arXiv:2605.27654

Cultural Fidelity in English-to-Hindi Translation: A Preservation-Fluency Frontier for Gender Recoverability

Samyak Savi, Chavi Gupta, Shreyas Gantayet, Tanay Sodha, Dhruv Kumar

AI Summary

This paper investigates gender recoverability in English-to-Hindi translation, identifying a significant issue where translation systems erase gender cues through ergative and honorific constructions. To address this, they introduce two inference-time interventions: a Source-Aware Reranker (SAR) to avoid gender-neutralizing syntax and a Phenomenon-Aware Reranker (PAR) to preserve gender through targeted lexical marking. Experiments on a 37K instance benchmark show that PAR substantially improves gender preservation, but at the cost of reduced fluency, highlighting a trade-off between cultural fidelity and stylistic naturalness.

Key Contribution

English-to-Hindi translation often neuters gender cues, but a simple reranking trick can boost gender preservation by 70%—though fluency takes a hit.

Abstract

Generative translation systems are cultural technologies because they decide how socially meaningful cues are rendered within culturally specific grammatical systems. We study one concrete notion of successful cultural translation: when an English source explicitly encodes gender, an English-to-Hindi translation should preserve the recoverability of that cue unless the source itself is ambiguous. We evaluate this criterion on a 37,345-instance benchmark spanning twelve categories and show that five systems frequently erase gender through ergative and honorific constructions. We then introduce two mechanism-aware inference-time interventions. The first, the Source-Aware Reranker (SAR), prefers candidates that avoid gender-neutralizing syntax. The second, the Phenomenon-Aware Reranker (PAR), preserves gender through targeted lexical marking even when ergative syntax remains. Across GPT-4o-mini and Sarvam, PAR improves target-subset accuracy from 11.07% to 54.47% and from 15.99% to 49.66%, respectively. Human evaluation shows that PAR increases gender preservation from 10.3% to 81.3%, but reduces mean fluency from 4.36 to 3.37. These findings place the two interventions on a preservation and fluency frontier rather than supporting a single dominant solution, and show how culturally situated generation can require explicit tradeoffs among fidelity, fluency, and stylistic naturalness.

Eval Frameworks & Benchmarks Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References23

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Cultural Fidelity in English-to-Hindi Translation: A Preservation-Fluency Frontier for Gender Recoverability

Related Papers