Original title:
Konzistence lingvistických anotací
Translated title:
Consistency of Linguistic Annotation
Authors:
Aggarwal, Akshay ; Zeman, Daniel (advisor) ; Lopatková, Markéta (referee) Document type: Master’s theses
Year:
2020
Language:
eng Abstract:
Thesis Abstract Akshay Aggarwal July 2020 This thesis attempts at correction of some errors and inconsistencies in dif- ferent treebanks. The inconsistencies can be related to linguistic constructions, failure of the guidelines of annotation, failure to understand the guidelines on annotator's part, or random errors caused by annotators, among others. We propose a metric to attest the POS annotation consistency of different tree- banks in the same language, when the annotation guidelines remain the same. We offer solutions to some previously identified inconsistencies in the scope of the Universal Dependencies Project, and check the viability of a proposed in- consistency detection tool in a low-resource setting. The solutions discussed in the thesis are language-neutral, intended to work with multiple languages with efficiency. 1
Keywords:
Annotation Consistency; Annotation Inconsistency; Error Mining; Language Independent; Morphology; Syntax; UD Project; Universal Dependencies; dobývání chyb; jazykově nezávislé; konzistence anotace; morfologie; nekonzistence anotace; projekt UD; syntax; Universal Dependencies
Institution: Charles University Faculties (theses)
(web)
Document availability information: Available in the Charles University Digital Repository. Original record: http://hdl.handle.net/20.500.11956/120867