National Repository of Grey Literature 14 records found  1 - 10next  jump to record: Search took 0.01 seconds. 
Data Mining Methods for Text Analysis
Kozák, Ondřej ; Marcoň, Petr (referee) ; Dohnal, Přemysl (advisor)
This bachelor thesis explores the current methodology and possibilities of text mining and the subsequent application of some methods. The thesis described methods for preprocessing, methods for converting text to vector space and methods for text analysis and discusses their possible applications. The different preprocessing methods were applied to the text and then the conversion to vector space was demonstrated using simple methods such as BOW, Bag of n-grams, TF-IDF or with machine learning methods which are FastText and GloVe. LSA, LDA, TextRank and cosine similarity methods were applied to the extracted vectors to extract information from the text.
Web Application of Recommender System
Koníček, Igor ; Bartík, Vladimír (referee) ; Zendulka, Jaroslav (advisor)
This master's thesis describes creation of recommender system that is used in real server cbdb.cz. A~fully operational recommender system was developed using collaborative and content-based filtering techniques. Thanks to many user feedback, we were able to evaluate their opinion. Many recommended books were tagged as desirable. This thesis is extending current functionality of cbdb.cz with recommender system. This system uses its extensive database of ratings, users and books.
Quality Analysis of Electronic Dictionaries Transformation
Stehlíková, Petra ; Škoda, Petr (referee) ; Kouřil, Jan (advisor)
The bachelor's thesis deals with electronic dictionaries, their formats and quality analysis of their conversions. The thesis describes Lexical Markup Framework format in detail. It also discusses the capabilities of advanced algorithms such as LSA for conversion quality analysis and the tools that can be used for the analysis. Based on this theoretical knowledge the scripts in Python language were created to analyze dictionaries in Lexical Markup Framework format.
Algorithms for anomaly detection in data from clinical trials and health registries
Bondarenko, Maxim ; Blaha, Milan (referee) ; Schwarz, Daniel (advisor)
This master's thesis deals with the problems of anomalies detection in data from clinical trials and medical registries. The purpose of this work is to perform literary research about quality of data in clinical trials and to design a personal algorithm for detection of anomalous records based on machine learning methods in real clinical data from current or completed clinical trials or medical registries. In the practical part is described the implemented algorithm of detection, consists of several parts: import of data from information system, preprocessing and transformation of imported data records with variables of different data types into numerical vectors, using well known statistical methods for detection outliers and evaluation of the quality and accuracy of the algorithm. The result of creating the algorithm is vector of parameters containing anomalies, which has to make the work of data manager easier. This algorithm is designed for extension the palette of information system functions (CLADE-IS) on automatic monitoring the quality of data by detecting anomalous records.
Automatic Testing of JavaScript Restrictor Project
Bednář, Martin ; Pluskal, Jan (referee) ; Polčák, Libor (advisor)
The aim of the thesis was to design, implement and evaluate the results of automatic tests for the JavaScript Restrictor project, which is being developed as a web browser extension. The tests are divided into three levels - unit, integration, and system. The Unit Tests verify the behavior of individual features, the Integration Tests verify the correct wrapping of browser API endpoints, and the System Tests check that the extension does not suppress the desired functionality of web pages. The System Tests are implemented for parallel execution in a distributed environment which has succeeded in achieving an almost directly proportional reduction in time with respect to the number of the tested nodes. The benefit of this work is detection of previously unknown errors in the JavaScript Restrictor extension and provision of the necessary information that allowed to fix some of the detected bugs.
Algorithms for anomaly detection in data from clinical trials and health registries
Bondarenko, Maxim ; Blaha, Milan (referee) ; Schwarz, Daniel (advisor)
This master's thesis deals with the problems of anomalies detection in data from clinical trials and medical registries. The purpose of this work is to perform literary research about quality of data in clinical trials and to design a personal algorithm for detection of anomalous records based on machine learning methods in real clinical data from current or completed clinical trials or medical registries. In the practical part is described the implemented algorithm of detection, consists of several parts: import of data from information system, preprocessing and transformation of imported data records with variables of different data types into numerical vectors, using well known statistical methods for detection outliers and evaluation of the quality and accuracy of the algorithm. The result of creating the algorithm is vector of parameters containing anomalies, which has to make the work of data manager easier. This algorithm is designed for extension the palette of information system functions (CLADE-IS) on automatic monitoring the quality of data by detecting anomalous records.
Web Application of Recommender System
Hlaváček, Pavel ; Bartík, Vladimír (referee) ; Zendulka, Jaroslav (advisor)
This thesis deals with problems of recommender systems and their usage in web applications. There are three main data mining techniques summarized and individual approaches for recommendation. Main part of this thesis is a suggestion and an implementation of web applications for recommending dishes from restaurants. Algorithm for food recommending is designed and implemented in this paper. The algorithm deals with the problem of frequently changing items. The algorithm utilizes hybrid filtering technique which is based on content and knowledge. This filtering technique uses cosine vector similarity for computation.
Data Mining Methods for Text Analysis
Kozák, Ondřej ; Marcoň, Petr (referee) ; Dohnal, Přemysl (advisor)
This bachelor thesis explores the current methodology and possibilities of text mining and the subsequent application of some methods. The thesis described methods for preprocessing, methods for converting text to vector space and methods for text analysis and discusses their possible applications. The different preprocessing methods were applied to the text and then the conversion to vector space was demonstrated using simple methods such as BOW, Bag of n-grams, TF-IDF or with machine learning methods which are FastText and GloVe. LSA, LDA, TextRank and cosine similarity methods were applied to the extracted vectors to extract information from the text.
Automatic Testing of JavaScript Restrictor Project
Bednář, Martin ; Pluskal, Jan (referee) ; Polčák, Libor (advisor)
The aim of the thesis was to design, implement and evaluate the results of automatic tests for the JavaScript Restrictor project, which is being developed as a web browser extension. The tests are divided into three levels - unit, integration, and system. The Unit Tests verify the behavior of individual features, the Integration Tests verify the correct wrapping of browser API endpoints, and the System Tests check that the extension does not suppress the desired functionality of web pages. The System Tests are implemented for parallel execution in a distributed environment which has succeeded in achieving an almost directly proportional reduction in time with respect to the number of the tested nodes. The benefit of this work is detection of previously unknown errors in the JavaScript Restrictor extension and provision of the necessary information that allowed to fix some of the detected bugs.
Quality Analysis of Electronic Dictionaries Transformation
Stehlíková, Petra ; Škoda, Petr (referee) ; Kouřil, Jan (advisor)
The bachelor's thesis deals with electronic dictionaries, their formats and quality analysis of their conversions. The thesis describes Lexical Markup Framework format in detail. It also discusses the capabilities of advanced algorithms such as LSA for conversion quality analysis and the tools that can be used for the analysis. Based on this theoretical knowledge the scripts in Python language were created to analyze dictionaries in Lexical Markup Framework format.

National Repository of Grey Literature : 14 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.