Search papers, labs, and topics across Lattice.
This paper introduces annotation guidelines for cross-speaker syntactic dependencies in spoken language treebanks using the Universal Dependencies framework, covering phenomena like collaborative constructions and question-answer pairs. It proposes both speaker-based and dependency-based representations to capture these inter-turn relationships. The guidelines also refine distinctions between reformulations and repairs, and promote elements within unfinished phrases to improve annotation consistency.
Finally, a way to represent the messy, collaborative syntax of real spoken language in treebanks.
The paper proposes annotation guidelines for syntactic dependencies that span across speaker turns - including collaborative coconstructions proper, wh-question answers, and backchannels - in spoken language treebanks within the Universal Dependencies framework. Two representations are proposed: a speaker-based representation following the segmentation into speech turns, and a dependency-based representation with dependencies across speech turns. New propositions are also put forward to distinguish between reformulations and repairs, and to promote elements in unfinished phrases.