keywords:"TF-IDF" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"TF-IDF"

Search:

Search Tips :: Advanced Search

Search collections:

Sort by:	Display results:	Output format:

	Detekce kategorie obsahu webové stránky prostřednictvím metod strojového učení. DOHNAL, Patrik This bachelor thesis is focused on design and the implementation of the algorithm for classifying the websites into a several categories. The implementation of this software is written in Python. For classifying purposes I use machine learning models such as Naive Bayes classifier, K-Nearest neighbors and Support Vector Machines. Within the process it is assumed to collect my own dataset, wich will be used for training and testing purposes. Thesis also includes detailed description of the methods I uesd. Detailed record
	Data Mining Methods for Text Analysis Kozák, Ondřej ; Marcoň, Petr (referee) ; Dohnal, Přemysl (advisor) This bachelor thesis explores the current methodology and possibilities of text mining and the subsequent application of some methods. The thesis described methods for preprocessing, methods for converting text to vector space and methods for text analysis and discusses their possible applications. The different preprocessing methods were applied to the text and then the conversion to vector space was demonstrated using simple methods such as BOW, Bag of n-grams, TF-IDF or with machine learning methods which are FastText and GloVe. LSA, LDA, TextRank and cosine similarity methods were applied to the extracted vectors to extract information from the text. Detailed record
	Searching relevant articles in extensive collections Vojt, Ján ; Novák, Jiří (advisor) ; Bartoš, Tomáš (referee) Searching text in articles is usually implemented with fulltext search. Using more advanced techniques however, it is possible to achieve significantly better results. The subject of this work is to create a universal library for searching extensible collections, specialized in czech language. The library makes use of tools capable of working with morphology while considering importance of words. It also conducts an experiment with word pairs, which adds context into the search process. The success rate of this experiment is tried on an extensible collection of data. Created library is a unique tool for processing extensible collections of czech text, while at the same time it is ready for further extension by new languages and methods. Detailed record
	Analysis of Mobile Devices Network Communication Data Abraham, Lukáš ; Bartík, Vladimír (referee) ; Burgetová, Ivana (advisor) At the beginning, the work describes DNS and SSL/TLS protocols, it mainly deals with communication between devices using these protocols. Then we'll talk about data preprocessing and data cleaning. Furthermore, the thesis deals with basic data mining techniques such as data classification, association rules, information retrieval, regression analysis and cluster analysis. The next chapter we can read something about how to identify mobile devices on the network. We will evaluate data sets that contain collected data from communication between the above mentioned protocols, which will be used in the practical part. After that, we finally get to the design of a system for analyzing network communication data. We will describe the libraries, which we used and the entire system implementation. We will perform a large number of experiments, which we will finally evaluate. Detailed record
	Pokročilý porovnávač produktov Prexta, Dávid This thesis deals with the problem of mining structured information concerning the features of the products from the open text, using open information extraction. These features will make it easier for customers to choose their product. In the beginning, it deals with existing solutions, their shortcomings and analysis of available systems for open information extraction. Furthermore, the theoretical background and technology used in the creation of the system, the design of the system itself and its implementation are discussed. At the end, the system testing, its results and extensions that could be implemented in the future are described. Detailed record
	DNS Data Analysis for Mobile Device Identification Purposes Sporni, Alex ; Bartík, Vladimír (referee) ; Burgetová, Ivana (advisor) This bachelor's thesis deals with the problem of identification of mobile devices based on DNS data analysis. The thesis provides a theoretical introduction to the computer communication model. This thesis explains the importance of DNS in the terms of network communication between devices, It also presents the provided data sets, which contain real communication of mobile devices. These data sets must be with a suitable technique parsed and stored in a database to provide better data manipulation techniques in the later stages of implementation. This work further describes individual techniques of data processing. It also depicts in detail the methodologies for evaluating the relevance of TF-IDF and the application of cosine similarity to identify the mobile devices. The main output of this work is the evaluation of the achieved results. Detailed record
	Comparison of Classification Methods Dočekal, Martin ; Zendulka, Jaroslav (referee) ; Burgetová, Ivana (advisor) This thesis deals with a comparison of classification methods. At first, these classification methods based on machine learning are described, then a classifier comparison system is designed and implemented. This thesis also describes some classification tasks and datasets on which the designed system will be tested. The evaluation of classification tasks is done according to standard metrics. In this thesis is presented design and implementation of a classifier that is based on the principle of evolutionary algorithms. Detailed record
	Advanced Machine-Learning Methods for Text Classification Dočekal, Martin ; Otrusina, Lubomír (referee) ; Smrž, Pavel (advisor) This thesis deals with advanced machine-learning methods for text classification. At first, these methods are described, and then text classification system is created based on these methods. The system also provides tools for document preprocessing and evaluation of classifier. The thesis describes the use of the system in a real-life task. Detailed record
	Searching relevant articles in extensive collections Vojt, Ján ; Novák, Jiří (advisor) ; Bartoš, Tomáš (referee) Searching text in articles is usually implemented with fulltext search. Using more advanced techniques however, it is possible to achieve significantly better results. The subject of this work is to create a universal library for searching extensible collections, specialized in czech language. The library makes use of tools capable of working with morphology while considering importance of words. It also conducts an experiment with word pairs, which adds context into the search process. The success rate of this experiment is tried on an extensible collection of data. Created library is a unique tool for processing extensible collections of czech text, while at the same time it is ready for further extension by new languages and methods. Detailed record
	Events and Places Agregation and Suggestions from Facebook Dubeň, Matej ; Plchot, Oldřich (referee) ; Szőke, Igor (advisor) The aim of this bachelor thesis is to explain the design and implementation of an Android application "Let's Go Out", which can recommend Facebook events and places to the user. The recommendation is carried out by using the hybrid recommending system approach that links together the collaborative filtering and a content-based recommendation approach, tracks the user's interaction with the application and, based on recorded data, adapts to the recommendation process. This thesis also describes the testing process that compares the recommender systems of competitive applications and points out achievements. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English