Search papers, labs, and topics across Lattice.
This paper introduces a novel neural machine translation (NMT) decoder that generates target-language dependency trees in a top-down, left-to-right manner. The motivation is to improve translation of long and rare input sequences, where standard encoder-decoder NMT models often struggle. Experiments demonstrate that this top-down string-to-tree decoding approach generalizes better than sequence-to-sequence decoding, particularly for long, unseen inputs.
Syntactic NMT decoders don't have to be bottom-up: a top-down tree generation strategy can drastically improve translation of long, rare sequences.
Most of modern neural machine translation (NMT) models are based on an encoder-decoder framework with an attention mechanism. While they perform well on standard datasets, they can have trouble in translation of long inputs that are rare or unseen during training. Incorporating target syntax is one approach to dealing with such length-related problems. We propose a novel syntactic decoder that generates a target-language dependency tree in a top-down, left-to-right order. Experiments show that the proposed top-down string-to-tree decoding generalizes better than conventional sequence-to-sequence decoding in translating long inputs that are not observed in the training data.