National Repository of Grey Literature 8 records found  Search took 0.00 seconds. 
Automatic post-editing of phrase-based machine translation outputs
Rosa, Rudolf ; Mareček, David (advisor) ; Žabokrtský, Zdeněk (referee)
We present Depfix, a system for automatic post-editing of phrase-based English-to-Czech machine trans- lation outputs, based on linguistic knowledge. First, we analyzed the types of errors that a typical machine translation system makes. Then, we created a set of rules and a statistical component that correct errors that are common or serious and can have a potential to be corrected by our approach. We use a range of natural language processing tools to provide us with analyses of the input sentences. Moreover, we reimple- mented the dependency parser and adapted it in several ways to parsing of statistical machine translation outputs. We performed both automatic and manual evaluations which confirmed that our system improves the quality of the translations.
Multi-source Text Summarization for Czech
Brus, Tomáš ; Bojar, Ondřej (advisor) ; Mareček, David (referee)
This work focuses on the summarization task for a set of articles on the same topic. It discusses several possible ways of summarizations and ways to assess their final quality. The implementation of the described algorithms and their application to selected texts constitutes a part of this work. The input texts come from several Czech news servers and they are represented as deep syntactic trees (the so called tectogrammatical layer).
Web Interface for the Treex Framework
Sedlák, Michal ; Popel, Martin (advisor) ; Rosa, Rudolf (referee)
This work deals with a web application called Treex::Web which serves as a web interface for NLP framework Treex. The work addresses several Treex issues (e.g. absence of graphical user interface and complicated installation) and offers Treex::Web as a possible solution. At the beginning of this work we introduce the Treex framework itself. The following chapters describe Treex::Web's user interface (chapter 3) and the implementation of the whole web application (chapter 4). Conclusion of this work includes a comparison of NLP frameworks similar to Treex and their web interfaces. 1
Automatic Error Correction of Machine Translation Output
Variš, Dušan ; Bojar, Ondřej (advisor) ; Mareček, David (referee)
We present MLFix, an automatic statistical post-editing system, which is a spiritual successor to the rule- based system, Depfix. The aim of this thesis was to investigate the possible approaches to automatic identification of the most common morphological errors produced by the state-of-the-art machine translation systems and to train sufficient statistical models built on the acquired knowledge. We performed both automatic and manual evaluation of the system and compared the results with Depfix. The system was mainly developed on the English-to- Czech machine translation output, however, the aim was to generalize the post-editing process so it can be applied to other language pairs. We modified the original pipeline to post-edit English-German machine translation output and performed additional evaluation of this modification. Powered by TCPDF (www.tcpdf.org)
Automatic Error Correction of Machine Translation Output
Variš, Dušan ; Bojar, Ondřej (advisor) ; Mareček, David (referee)
We present MLFix, an automatic statistical post-editing system, which is a spiritual successor to the rule- based system, Depfix. The aim of this thesis was to investigate the possible approaches to automatic identification of the most common morphological errors produced by the state-of-the-art machine translation systems and to train sufficient statistical models built on the acquired knowledge. We performed both automatic and manual evaluation of the system and compared the results with Depfix. The system was mainly developed on the English-to- Czech machine translation output, however, the aim was to generalize the post-editing process so it can be applied to other language pairs. We modified the original pipeline to post-edit English-German machine translation output and performed additional evaluation of this modification. Powered by TCPDF (www.tcpdf.org)
Automatic post-editing of phrase-based machine translation outputs
Rosa, Rudolf ; Mareček, David (advisor) ; Žabokrtský, Zdeněk (referee)
We present Depfix, a system for automatic post-editing of phrase-based English-to-Czech machine trans- lation outputs, based on linguistic knowledge. First, we analyzed the types of errors that a typical machine translation system makes. Then, we created a set of rules and a statistical component that correct errors that are common or serious and can have a potential to be corrected by our approach. We use a range of natural language processing tools to provide us with analyses of the input sentences. Moreover, we reimple- mented the dependency parser and adapted it in several ways to parsing of statistical machine translation outputs. We performed both automatic and manual evaluations which confirmed that our system improves the quality of the translations.
Web Interface for the Treex Framework
Sedlák, Michal ; Popel, Martin (advisor) ; Rosa, Rudolf (referee)
This work deals with a web application called Treex::Web which serves as a web interface for NLP framework Treex. The work addresses several Treex issues (e.g. absence of graphical user interface and complicated installation) and offers Treex::Web as a possible solution. At the beginning of this work we introduce the Treex framework itself. The following chapters describe Treex::Web's user interface (chapter 3) and the implementation of the whole web application (chapter 4). Conclusion of this work includes a comparison of NLP frameworks similar to Treex and their web interfaces. 1
Multi-source Text Summarization for Czech
Brus, Tomáš ; Bojar, Ondřej (advisor) ; Mareček, David (referee)
This work focuses on the summarization task for a set of articles on the same topic. It discusses several possible ways of summarizations and ways to assess their final quality. The implementation of the described algorithms and their application to selected texts constitutes a part of this work. The input texts come from several Czech news servers and they are represented as deep syntactic trees (the so called tectogrammatical layer).

Interested in being notified about new results for this query?
Subscribe to the RSS feed.