National Repository of Grey Literature 83 records found  beginprevious51 - 60nextend  jump to record: Search took 0.01 seconds. 
Semantic Enrichment Component
Doležal, Jan ; Otrusina, Lubomír (referee) ; Dytrych, Jaroslav (advisor)
This master's thesis describes Semantic Enrichment Component (SEC), that searches entities (e.g., persons or places) in the input text document and returns information about them. The goals of this component are to create a single interface for named entity recognition tools, to enable parallel document processing, to save memory while using the knowledge base, and to speed up access to its content. To achieve these goals, the output of the named entity recognition tools in the text was specified, the tool for storing the preprocessed knowledge base into the shared memory was implemented, and the client-server scheme was used to create the component.
Annotation Editor for Semantic Analysis of Text
Šťastná, Barbora ; Otrusina, Lubomír (referee) ; Dytrych, Jaroslav (advisor)
The thesis introduces the most crucial terms related with text annotation, and shows some electronic tools for annotation of electronic documents. It describes an annotation editor which provides the user with a graphic interface and allows for annotation of web documents. It also proposes modifications to the editor which would make it more intuitive, efficient, and user-friendly. The thesis follows with the description of implementation of the said modifications, and their testing.
Analysis of architecture and superstructure possibilities of framework JUnit
Kadeřábek, Martin ; Pecinovský, Rudolf (advisor) ; Pavlíčková, Jarmila (referee)
JUnit is one of the most popular test framework in Java programming language. Its new version is going to come into reality under the name JUnit 5 which introduces new concept and possibilities for writing unit tests. The main subject of my bachelor's thesis is analysis of concept, architecture and superstructure possibilities of developed framework JUnit 5. Thereto is used of interpretation of news launched into the market, which is replenished by demonstration examples. The bachelor's thesis describes and explains new architecture and new possibilities of framework JUnit 5. The first part of the bachelor's thesis describes per unit testing. It's contextualized in software testing and the framework JUnit is classified among other testing frameworks. The second part of the bachelor's thesis deals with new version of framework JUnit and analyses its architecture and its possibilities for writing tests. At the end the recommendation for education purposes is given.
Discourse Relations in Czech
Poláková, Lucie ; Hajičová, Eva (advisor) ; Hoffmannová, Jana (referee) ; Pešek, Ondřej (referee)
This doctoral thesis is devoted to linguistic analysis of discourse relations as one of the aspects of discourse coherence. Discourse relations are semantic relations holding between propositions in a discourse (discourse arguments). The aim of the thesis is a complex description of discourse relations in Czech and its application in an annotation scheme in the Prague Dependency Treebank. The thesis is divided into three parts: The first one is focused on the theoretical description of discourse relations and on analysis of adequacy of various methodological concepts in corpus processing. The second part describes in detail the proposed scheme for the annotation of discourse relations and the process of the corpus build- up including the evaluation of consistency of the annotated data. Finally, in the last part of the thesis, we address some problematic issues arisen with the employment the proposed scheme and look for their possible solutions.
Assessing the impact of manual corrections in the Groningen Meaning Bank
Weck, Benno ; Lopatková, Markéta (advisor) ; Vidová Hladká, Barbora (referee)
The Groningen Meaning Bank (GMB) project develops a corpus with rich syntactic and semantic annotations. Annotations in GMB are generated semi-automatically and stem from two sources: (i) Initial annotations from a set of standard NLP tools, (ii) Corrections/refinements by human annotators. For example, on the part-of-speech level of annotation there are currently 18,000 of those corrections, so called Bits of Wisdom (BOWs). For applying this information to boost the NLP processing we experiment how to use the BOWs in retraining the part-of-speech tagger and found that it can be improved to correct up to 70% of identified errors within held-out data. Moreover an improved tagger helps to raise the performance of the parser. Preferring sentences with a high rate of verified tags in retraining has proven to be the most reliable way. With a simulated active learning experiment using Query-by-Uncertainty (QBU) and Query-by- Committee (QBC) we proved that selectively sampling sentences for retraining yields better results with less data needed than random selection. In an additional pilot study we found that a standard maximum-entropy part-of-speech tagger can be augmented so that it uses already known tags to enhance its tagging decisions on an entire sequence without retraining a new model first. Powered by...
Measures of Machine Translation Quality
Macháček, Matouš ; Bojar, Ondřej (advisor) ; Kuboň, Vladislav (referee)
Title: Measures of Machine Translation Quality Author: Matouš Macháček Department: Institute of Formal and Applied Linguistics Supervisor: RNDr. Ondřej Bojar, Ph.D. Abstract: We explore both manual and automatic methods of machine trans- lation evaluation. We propose a manual evaluation method in which anno- tators rank only translations of short segments instead of whole sentences. This results in easier and more efficient annotation. We have conducted an annotation experiment and evaluated a set of MT systems using this method. The obtained results are very close to the official WMT14 evaluation results. We also use the collected database of annotations to automatically evalu- ate new, unseen systems and to tune parameters of a statistical machine translation system. The evaluation of unseen systems, however, does not work and we analyze the reasons. To explore the automatic methods, we organized Metrics Shared Task held during the Workshop of Statistical Ma- chine Translation in years 2013 and 2014. We report the results of the last shared task, discuss various metaevaluation methods and analyze some of the participating metrics. Keywords: machine translation, evaluation, automatic metrics, annotation
The role of annotations of fiction works (the use of annotations in public libraries)
Camfrla, Štěpán ; Buřilová, Marcela (advisor) ; Hubrtová, Jitka (referee)
This thesis concerns the role of annotations of fiction works, and it describes the significance of annotation, its functions, as well as its actual use in public libraries. The thesis shows the clash between theoretical expectations and practical results. In theory, annotation is presented as an autonomous literary text the purpose of which is to present the reader with accurate and appropriate information about the books available in a library's Online Public Access Catalog. In practise, however, libraries have been trying to provide information at all costs, shifting away from creative methods of annotating. The closing chapter gives a glimpse of possible future development of the issue.
Detekce podezřelých anotací
Václ, Jan ; Vidová Hladká, Barbora (advisor) ; Hana, Jiří (referee)
This work describes a machine learning approach for checking the part-of-speech annotation, and presents its implementation - a system called MissTagger. The checking procedure covers both error detection and error correction. MissTagger employs a simplified instance-based learning algorithm where the words in the text are recognized as instances. Part-of-speech tags of context of static length are selected as features, no lexical information is included. The words whose tags comprises this context are chosen based either on a linear or on a dependency-tree structure of the sentence. Two languages are examined in the experiments for evaluation, Czech and English.
Mobile Application for Image Annotation
Hřebíček, Pavel ; Čadík, Martin (referee) ; Brejcha, Jan (advisor)
This thesis deals with a topic of creation of cross-platform mobile application for image annotation. The theoretical part of my thesis describes the specification of the application both in terms of design and functionality. The practical part discusses the actual implementation of the application from the technologies used through the application architecture to implementation of the tools and functions themselves. For the practical part of this thesis was used the Qt framework, which is based on the C++ language. It was also necessary to become familiar with the issues of UX Design and Usability testing.

National Repository of Grey Literature : 83 records found   beginprevious51 - 60nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.