keywords:"TF-IDF" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"TF-IDF"

Search:

Search Tips :: Advanced Search

Search collections:

Sort by:	Display results:	Output format:

	Events and Places Agregation and Suggestions from Facebook Dubeň, Matej ; Plchot, Oldřich (referee) ; Szőke, Igor (advisor) The aim of this bachelor thesis is to explain the design and implementation of an Android application "Let's Go Out", which can recommend Facebook events and places to the user. The recommendation is carried out by using the hybrid recommending system approach that links together the collaborative filtering and a content-based recommendation approach, tracks the user's interaction with the application and, based on recorded data, adapts to the recommendation process. This thesis also describes the testing process that compares the recommender systems of competitive applications and points out achievements. Detailed record
	Classification Framework Koroncziová, Dominika ; Otrusina, Lubomír (referee) ; Kouřil, Jan (advisor) The goal of this work is the design and implementation of a machine learning software, based on the RapidMiner library. The finished application integrates the most commonly used algorithms and processes implemented in RapidMiner into an easily usable program. The application contains a simple command line interface, as well as a graphic interface to simplify selection of multiple parameters. The program also provides a tool to create standalone programs, that can be used for classification with a pre-trained model. On top of the original requirements the possibility to work with textual data from Wikipedia was also implemented, providing a tool for downloading and preprocessing of the data in order to use them as training input. This text focuses on the specifics of the algorithms and classifiers used and on their features and uses, and describes the design and implementation of the system. As part of this work, several tests were run in order to validate the efficiency and functionality of the program. The test results are included at the end of the thesis. Detailed record
	DNS Data Analysis for Mobile Device Identification Purposes Sporni, Alex ; Bartík, Vladimír (referee) ; Burgetová, Ivana (advisor) This bachelor's thesis deals with the problem of identification of mobile devices based on DNS data analysis. The thesis provides a theoretical introduction to the computer communication model. This thesis explains the importance of DNS in the terms of network communication between devices, It also presents the provided data sets, which contain real communication of mobile devices. These data sets must be with a suitable technique parsed and stored in a database to provide better data manipulation techniques in the later stages of implementation. This work further describes individual techniques of data processing. It also depicts in detail the methodologies for evaluating the relevance of TF-IDF and the application of cosine similarity to identify the mobile devices. The main output of this work is the evaluation of the achieved results. Detailed record
	Analysis of Mobile Devices Network Communication Data Abraham, Lukáš ; Bartík, Vladimír (referee) ; Burgetová, Ivana (advisor) At the beginning, the work describes DNS and SSL/TLS protocols, it mainly deals with communication between devices using these protocols. Then we'll talk about data preprocessing and data cleaning. Furthermore, the thesis deals with basic data mining techniques such as data classification, association rules, information retrieval, regression analysis and cluster analysis. The next chapter we can read something about how to identify mobile devices on the network. We will evaluate data sets that contain collected data from communication between the above mentioned protocols, which will be used in the practical part. After that, we finally get to the design of a system for analyzing network communication data. We will describe the libraries, which we used and the entire system implementation. We will perform a large number of experiments, which we will finally evaluate. Detailed record
	Methods of Web Page Classification Nachtnebl, Viktor ; Burget, Radek (referee) ; Bartík, Vladimír (advisor) This work deals with methods of web page classification. It explains the concept of classification and different features of web pages used for their classification. Further it analyses representation of a page and in detail describes classification method that deals with hierarchical category model and is able to dynamically create new categories. In the second half it shows implementation of chosen method and describes the results. Detailed record
	Semantic Similarity of Articles Veselovský, Martin ; Otrusina, Lubomír (referee) ; Kouřil, Jan (advisor) This bachelor's thesis deals with modelling of structure of semantic relationships among articles in English language. There are introduced existing methods of articles representation and computation of similarity. The base method is vector space model, which represents document as vector of words. There are given weights of importance to these words using TF-IDF method. Next, there are described advanced methods of modelling, Latent semantic analysis (LSA) and Latent Dirichlet allocation (LDA). This thesis also deals with articles, which are semantically annotated, while weights of annotation words are computed by Stochastic Gradient Descent method. Evaluation of results takes place on the prepared test corpus of documents to which there is reference similarity evaluation. Detailed record
	Derivation of Dictionary for Process Inspector Tool on SharePoint Platform Pavlín, Václav ; Masařík, Karel (referee) ; Kreslíková, Jitka (advisor) This master's thesis presents methods for mining important pieces of information from text. It analyses the problem of terms extraction from large document collection and describes the implementation using C# language and Microsoft SQL Server. The system uses stemming and a number of statistical methods for term extraction. This project also compares used methods and suggests the process of the dictionary derivation. Detailed record
	Improved Prediction of Social Tags Using Data Mining Harár, Pavol ; Galáž, Zoltán (referee) ; Kříž, Jiří (advisor) This master’s thesis deals with using Text mining as a method to predict tags of articles. It describes the iterative way of handling big data files, parsing the data, cleaning the data and scoring of terms in article using TF-IDF. It describes in detail the flow of program written in programming language Python 3.4.3. The result of processing more than 1 million articles from Wikipedia database is a dictionary of English terms. By using this dictionary one is capable of determining the most important terms from article in corpus of articles. Relevancy of consequent tags proves the method used in this case. Detailed record
	Actual Events Tracker Odstrčilík, Martin ; Otrusina, Lubomír (referee) ; Kouřil, Jan (advisor) The goal of the master thesis project was to develop an application for tracking of actual events in the surrounding area of the users. This application should allow the users to view events, create new events and add comments to existing ones. Beyond the implementation of developed application, this project deals with an analysis of the presented problem. The analysis includes a comparison with existing solutions and search for available technologies and frameworks applicable for implementation. Another part inside this work is description of the theory in behind of data classification that is internally used for event and comment analysis. This work also includes a design of appliction including design of user interface, software architecture, database, communication protocol and data classifiers. The main part of this project, the implementation, is described aftewards. At the end of this work, there is a summary of the whole process and also there are given some ideas about enhancing the application in the future. Detailed record
	Binární klasifikace zákaznických incidentů pomocí metod NLP Pokorný, Jiří This bachelor thesis focuses on building a model for binary classification of customer incidents within the SAP system. By classifying the individual sentences of incidents, the final category of the incident is predicted. The used text is in English. To compare traditional and modern approaches to text classification as well as obtain optimal results, a series of experiments is carried out using different methods of balancing the dataset, vector representation and classification. Finally, the results are analyzed and recommendation is formulated with regard to further development, including applying knowledge gained within the SAP environment. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English