Search papers, labs, and topics across Lattice.
This paper introduces a novel parsing algorithm and syntactic rule set, drawing on CFG and GPSG, to generate both dependency and constituency parse trees while handling noisy input. The approach aims to bridge the gap between theoretical syntactic work and computational applications, resulting in a more interpretable NLP model. Experiments on Universal Dependencies data demonstrate a UAS of 54.5% on the development set and 53.8% on the test set, with the system providing multiple parse hypotheses for potential reranking.
Reviving syntactic rules offers a transparent and interpretable alternative to opaque neural parsing models, even if current accuracy lags behind state-of-the-art.
This research introduces a new parsing approach, based on earlier syntactic work on context free grammar (CFG) and generalized phrase structure grammar (GPSG). The approach comprises both a new parsing algorithm and a set of syntactic rules and features that overcome the limitations of CFG. It also generates both dependency and constituency parse trees, while accommodating noise and incomplete parses. The system was tested on data from Universal Dependencies, showing a promising average Unlabeled Attachment Score (UAS) of 54.5% in the development dataset (7 corpora) and 53.8% in the test set (12 corpora). The system also provides multiple parse hypotheses, allowing further reranking to improve parsing accuracy. This approach also leverages much of the theoretical syntactic work since the 1950s to be used within a computational context. The application of this approach provides a transparent and interpretable NLP model to process language input.