Original title: Konzistence lingvistických anotací
Translated title: Consistency of Linguistic Annotation
Authors: Aggarwal, Akshay ; Zeman, Daniel (advisor) ; Lopatková, Markéta (referee)
Document type: Master’s theses
Year: 2020
Language: eng
Abstract: Thesis Abstract Akshay Aggarwal July 2020 This thesis attempts at correction of some errors and inconsistencies in dif- ferent treebanks. The inconsistencies can be related to linguistic constructions, failure of the guidelines of annotation, failure to understand the guidelines on annotator's part, or random errors caused by annotators, among others. We propose a metric to attest the POS annotation consistency of different tree- banks in the same language, when the annotation guidelines remain the same. We offer solutions to some previously identified inconsistencies in the scope of the Universal Dependencies Project, and check the viability of a proposed in- consistency detection tool in a low-resource setting. The solutions discussed in the thesis are language-neutral, intended to work with multiple languages with efficiency. 1
Keywords: Annotation Consistency; Annotation Inconsistency; Error Mining; Language Independent; Morphology; Syntax; UD Project; Universal Dependencies; dobývání chyb; jazykově nezávislé; konzistence anotace; morfologie; nekonzistence anotace; projekt UD; syntax; Universal Dependencies

Institution: Charles University Faculties (theses) (web)
Document availability information: Available in the Charles University Digital Repository.
Original record: http://hdl.handle.net/20.500.11956/120867

Permalink: http://www.nusl.cz/ntk/nusl-434818


The record appears in these collections:
Universities and colleges > Public universities > Charles University > Charles University Faculties (theses)
Academic theses (ETDs) > Master’s theses
 Record created 2021-02-24, last modified 2022-03-04


No fulltext
  • Export as DC, NUŠL, RIS
  • Share