Mar 16, 2026arXiv:2603.14755

Learning Constituent Headedness

Zeyao Qi, Yige Chen, KyungTae Lim, Haihua Pan, Jungyeul Park

AI Summary

This paper introduces a method for explicitly learning constituent headedness in constituency trees by treating it as a supervised prediction task. They induce supervision from aligned constituency and dependency annotations, defining the constituent head as the dependency span head. Experiments on English and Chinese data show that the learned models achieve high accuracy, outperforming rule-based percolation methods and improving constituency-to-dependency conversion.

Key Contribution

Learning constituent headedness as a supervised task achieves near-ceiling accuracy and significantly boosts constituency-to-dependency conversion compared to traditional rule-based approaches.

Abstract

Headedness is widely used as an organizing device in syntactic analysis, yet constituency treebanks rarely encode it explicitly and most processing pipelines recover it procedurally via percolation rules. We treat this notion of constituent headedness as an explicit representational layer and learn it as a supervised prediction task over aligned constituency and dependency annotations, inducing supervision by defining each constituent head as the dependency span head. On aligned English and Chinese data, the resulting models achieve near-ceiling intrinsic accuracy and substantially outperform Collins-style rule-based percolation. Predicted heads yield comparable parsing accuracy under head-driven binarization, consistent with the induced binary training targets being largely equivalent across head choices, while increasing the fidelity of deterministic constituency-to-dependency conversion and transferring across resources and languages under simple label-mapping interfaces.

Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Learning Constituent Headedness

Related Papers