National Repository of Grey Literature 3 records found  Search took 0.01 seconds. 
Preprocessing and Transformation of Text Data Collections
Maruna, Viktor ; Burget, Radek (referee) ; Bartík, Vladimír (advisor)
This bachelor thesis deals with the issue of text-mining, mostly focused on preprocessing and transformation. In theoretical part there are contained information about development and principles of text-mining processes, text data collections and use in practice. The next part of this thesis describes in detail single steps of preprocessing and transformation of text data collections. In the final parts there are reviews of application development, testing and personal view on this thesis.
Preprocessing and Transformation of Text Data Collections
Maruna, Viktor ; Burget, Radek (referee) ; Bartík, Vladimír (advisor)
This bachelor thesis deals with the issue of text-mining, mostly focused on preprocessing and transformation. In theoretical part there are contained information about development and principles of text-mining processes, text data collections and use in practice. The next part of this thesis describes in detail single steps of preprocessing and transformation of text data collections. In the final parts there are reviews of application development, testing and personal view on this thesis.
Hledání sémantické informace v textových datech s využitím latentní analýzy
Řezníček, Pavel
The first part of thesis focuses on theoretical introduction to the methods of text mining -- Information retrieval, classification and clustering. LSA method is presented as an advanced model for representing textual data. Furthermore, the work describes source data and methods for their preprocessing and preparation used to enhance the effectiveness of text mining methods. For each chosen text mining method there are defined evaluation metrics and used already existing, or newly implemented, programs are presented. The results of experiments comparing the effects of different preprocessing type and use of different models of the source data are then demonstrated and discussed in the conclusion.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.