National Repository of Grey Literature 5 records found  Search took 0.00 seconds. 
An Implementation of Methods of Structural Analysis of Czech Complex Sentences
Dutkevič, Jiří ; Kuboň, Vladislav (advisor) ; Holan, Tomáš (referee)
Title: An Implementation of Methods of Structural Analysis of Czech Complex Sentences Author: Jiří Dutkevič Department: Institute of Formal and Applied Linguistics Supervisor: doc. RNDr. Vladislav Kuboň, Ph.D., Institute of Formal and Applied Linguistics Abstract: This paper discusses automated analysis of complex sentences in Czech language. It summarizes the results of preceding research, uses therein described method for splitting complex sentences into segments using well defined set of separators and proposes three methods of automated assignment of levels to segments (which also describe relations between the segments) in sentences based on rules presented in the research. First method directly applies the rules presented in referenced research papers, the second method uses a genetic algorithm and the third makes use of a neural network. This paper includes an implementation of these methods and an analysis of the results using manually annotated data from the Prague Dependency Treebank.
Detection of Intensity in Sentiment Analysis of Czech
Dargaj, Jakub ; Tamchyna, Aleš (advisor) ; Mareček, David (referee)
Sentiment analysis is concerned with automatic extraction of subjective information from text. The goal of this thesis is to predict the intensity of attitude in Czech texts. In order to solve this task, we prepared a dataset of movie reviews by users of Czech-Slovak Film Database. We compare several machine learning methods, focusing on feature extraction from text data. Using convolutional neural networks and corpus-dependent training of word embeddings, we surpassed basic models and achieved accuracy similar to the most recent results in this field. We also analyze the logistic regression model in order to compare the vocabulary used in reviews with different ratings.
Native Language Identification of L2 Speakers of Czech
Tydlitátová, Ludmila ; Hana, Jiří (advisor) ; Vidová Hladká, Barbora (referee)
Native Language Identification is the task of identifying an author's na- tive language based on their productions in a second language. The absolute majority of previous work has focused on English as the second language. In this thesis, we work with 3,715 essays written in Czech by non-native speakers. We use machine learning methods to determine whether an au- thors native language belongs to the Slavic language group. By training models with different feature and parameter settings, we were able to reach an accuracy of 78%. 1
Morphological Analyser of Old English
Tichý, Ondřej ; Čermák, Jan (advisor) ; Petkevič, Vladimír (referee) ; Kučera, Karel (referee)
The paper describes the construction and testing of an electronic application for automatic morphological analysis of Old English. It introduces resources and methodologies at our disposal based on the state of the art in the field of electronic analysis of Old English and on an overview of Old English morphology. A detailed account of the chosen methodology is offered and a specific description of the implementation is provided: from the acquisition and preparation of the input data and choice of technology to the programming and testing of the results. The resulting recall of 95% can be seen as a success of the project, however, the paper also shows how the recall may be improved. It also discusses further use of the analyser, especially the disambiguation of its results. The paper makes a future semi-automatic morphological tagging of Old English texts a real possibility. Powered by TCPDF (www.tcpdf.org)
An Implementation of Methods of Structural Analysis of Czech Complex Sentences
Dutkevič, Jiří ; Kuboň, Vladislav (advisor) ; Holan, Tomáš (referee)
Title: An Implementation of Methods of Structural Analysis of Czech Complex Sentences Author: Jiří Dutkevič Department: Institute of Formal and Applied Linguistics Supervisor: doc. RNDr. Vladislav Kuboň, Ph.D., Institute of Formal and Applied Linguistics Abstract: This paper discusses automated analysis of complex sentences in Czech language. It summarizes the results of preceding research, uses therein described method for splitting complex sentences into segments using well defined set of separators and proposes three methods of automated assignment of levels to segments (which also describe relations between the segments) in sentences based on rules presented in the research. First method directly applies the rules presented in referenced research papers, the second method uses a genetic algorithm and the third makes use of a neural network. This paper includes an implementation of these methods and an analysis of the results using manually annotated data from the Prague Dependency Treebank.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.