National Repository of Grey Literature 4 records found  Search took 0.00 seconds. 
Automatic Resolution of Pronoun Coreference in Czech
Košarko, Ondřej ; Mírovský, Jiří (advisor) ; Vidová Hladká, Barbora (referee)
Title: Automatic Resolution of Pronoun Coreference in Czech Author: Ondřej Košarko Department: ÚFAL MFF UK Supervisor: RNDr. Jiří Mírovský, Ph.D. Supervisor's e­mail address: mirovsky@ufal.mff.cuni.cz Abstract: The aim of this thesis is to introduce a procedure for automatic pronomial coreference resolution in Czech texts. The text is morphologically and analytically annotated acording to the system of Prague Dependency Treebank. The procedure uses a machine learning method; for its training a set of manually annotated data from Prague Dependency Treebank is used. Evaluation of the results is also part of this thesis. Keywords: pronomial coreference, automatic resolution, machine learning
CLARIN-DSpace repository at LINDAT/CLARIN : LINDAT/CLARIN FAIR repository for language data
Straňák, Pavel ; Košarko, Ondřej ; Mišutka, Jozef
We will present a software solution for and experience in running a digital repository for language data and natural language processing tools - LINDAT/CLARIN. We will present unique support for licensing with an emphasis on Open Access, and how we support all 4 key FAIR principles. We will show the submission workflow including license choice, approval and publishing or submissions by editors, as well as the repository administration environment including license definition, signing and access control. We will also present repository integration with other services, and statistics of operation.
Fulltext: Stranak_Kosarko_Misutka_fulltext - Download fulltextPDF
Slides: Stranak_prezentace_EN - Download fulltextPDF
Video: Stranak_video - Download fulltextMP4
Continously Learning Analyser of Audio-Visual Recordings
Košarko, Ondřej ; Peterek, Nino (advisor) ; Klusáček, David (referee)
This thesis introduces a tool for analysis of audiovisual records. The tool uses the audio and closed captions supplied by the user to prepare text annotation. The annotation contains a transcript of the show which is based on the closed captions. In addition, speaker diarization is performed to mark who spoke when. The diarization is performed by a third party library. The library is evaluated on data from DIALOG corpus. The inner workings of the library are described. To assign the right portions of the text to the right section of the record Kaldi, a speech recognition toolkit, is used. Furthermore the thesis contains an overview describing how closed captions are created; overview of speech corpora creation; and a brief review of literature on record analysis. 1
Automatic Resolution of Pronoun Coreference in Czech
Košarko, Ondřej ; Mírovský, Jiří (advisor) ; Vidová Hladká, Barbora (referee)
Title: Automatic Resolution of Pronoun Coreference in Czech Author: Ondřej Košarko Department: ÚFAL MFF UK Supervisor: RNDr. Jiří Mírovský, Ph.D. Supervisor's e­mail address: mirovsky@ufal.mff.cuni.cz Abstract: The aim of this thesis is to introduce a procedure for automatic pronomial coreference resolution in Czech texts. The text is morphologically and analytically annotated acording to the system of Prague Dependency Treebank. The procedure uses a machine learning method; for its training a set of manually annotated data from Prague Dependency Treebank is used. Evaluation of the results is also part of this thesis. Keywords: pronomial coreference, automatic resolution, machine learning

Interested in being notified about new results for this query?
Subscribe to the RSS feed.