Mar 30, 2026arXiv:2603.27938

Top-down string-to-dependency Neural Machine Translation

Shuhei Kondo, Katsuhito Sudoh, Yuji Matsumoto

AI Summary

This paper introduces a novel neural machine translation (NMT) decoder that generates target-language dependency trees in a top-down, left-to-right manner. The motivation is to improve translation of long and rare input sequences, where standard encoder-decoder NMT models often struggle. Experiments demonstrate that this top-down string-to-tree decoding approach generalizes better than sequence-to-sequence decoding, particularly for long, unseen inputs.

Key Contribution

Syntactic NMT decoders don't have to be bottom-up: a top-down tree generation strategy can drastically improve translation of long, rare sequences.

Abstract

Most of modern neural machine translation (NMT) models are based on an encoder-decoder framework with an attention mechanism. While they perform well on standard datasets, they can have trouble in translation of long inputs that are rare or unseen during training. Incorporating target syntax is one approach to dealing with such length-related problems. We propose a novel syntactic decoder that generates a target-language dependency tree in a top-down, left-to-right order. Experiments show that the proposed top-down string-to-tree decoding generalizes better than conventional sequence-to-sequence decoding in translating long inputs that are not observed in the training data.

Architecture Design (Transformers, SSMs, MoE)Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Top-down string-to-dependency Neural Machine Translation

Related Papers