Search papers, labs, and topics across Lattice.
This paper investigates the impact of dependency-induced headedness on punctuation-aware tree binarization for constituency parsing. The authors find that while learned heads outperform rule-based heads in head prediction, they do not consistently improve parsing performance after debinarization, particularly in punctuation-sensitive metrics. This suggests that linguistically grounded headedness may not be optimal for parser supervision via binarization.
Despite superior head prediction accuracy, learned heads fail to consistently improve constituency parsing, especially when evaluated on punctuation, challenging the assumption that better headedness directly translates to better parsing.
We revisit punctuation-aware tree binarization for constituency parsing and ask whether dependency-induced headedness improves binary parser supervision. Although learned heads substantially outperform rule-based heads in intrinsic head prediction, they do not yield consistent parsing gains after debinarization. In particular, punctuation-conditioned evaluation shows that learned headedness underperforms rule-based binarization in macro-average punctuation-sensitive $F_1$, despite a small overall gain on CTB. Similar instability appears under cross-treebank transfer. These results suggest that \ycc{linguistically grounded} headedness is not necessarily parser-optimal when used as a binarization control signal. The paper presents a negative result: better head prediction does not imply better punctuation-sensitive constituency parsing.