Search papers, labs, and topics across Lattice.
This paper investigates whether linguistically similar pivot languages and few-shot demonstrations can improve LLM translation in extremely low-resource settings without fine-tuning. They combine pivot-based prompting with in-context learning and evaluate translation performance under controlled conditions. Results show that while pivot-based prompting can yield modest improvements, especially for less represented languages, the gains are sensitive to example construction and diminish for closely related or better-represented languages.
Forget expensive fine-tuning: linguistically-informed prompting offers a lightweight, but sometimes unreliable, path to low-resource translation with LLMs.
Large Language Models (LLMs) have achieved strong performance across many downstream tasks, yet their effectiveness in extremely low-resource machine translation remains limited. Standard adaptation techniques typically rely on large-scale parallel data or extensive fine-tuning, which are infeasible for the long tail of underrepresented languages. In this work, we investigate a more constrained question: in data-scarce settings, to what extent can linguistically similar pivot languages and few-shot demonstrations provide useful guidance for on-the-fly adaptation in LLMs? We study a data-efficient experimental setup that combines linguistically related pivot languages with few-shot in-context examples, without any parameter updates, and evaluate translation behavior under controlled conditions. Our analysis shows that while pivot-based prompting can yield improvements in certain configurations, particularly in settings where the target language is less well represented in the model's vocabulary, the gains are often modest and sensitive to few shot example construction. For closely related or better represented varieties, we observe diminishing or inconsistent gains. Our findings provide empirical guidance on how and when inference-time prompting and pivot-based examples can be used as a lightweight alternative to fine-tuning in low-resource translation settings.