keywords:"Decision trees" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"Decision trees"

Search:



Search Tips :: Simple Search

Search collections:

Sort by:	Display results:	Output format:

	Implementation of Algorithms Based on Decision Trees in C# Grolig, Lukáš ; Pešek, Martin (referee) ; Stríž, Rostislav (advisor) This bachelor thesis is focused on selection of data mining algorithms based on decision trees for an analytical system developed under the project System for the Internet security increase based on malware spreading analysis. Selected algorithms are described in greater detais, as well as their implementation in the C# language. These algorithms are then tested with regards to their training speed and classification accuracy. Finally, this thesis presents further conclusions and recommendations based on performed experiments. Detailed record
	Utilization of artificial intelligence in technical diagnostics Konečný, Antonín ; Huzlík, Rostislav (referee) ; Zuth, Daniel (advisor) The diploma thesis is focused on the use of artificial intelligence methods for evaluating the fault condition of machinery. The evaluated data are from a vibrodiagnostic model for simulation of static and dynamic unbalances. The machine learning methods are applied, specifically supervised learning. The thesis describes the Spyder software environment, its alternatives, and the Python programming language, in which the scripts are written. It contains an overview with a description of the libraries (Scikit-learn, SciPy, Pandas ...) and methods — K-Nearest Neighbors (KNN), Support Vector Machines (SVM), Decision Trees (DT) and Random Forests Classifiers (RF). The results of the classification are visualized in the confusion matrix for each method. The appendix includes written scripts for feature engineering, hyperparameter tuning, evaluation of learning success and classification with visualization of the result. Detailed record
	Comparison of Heuristic and Conventional Statistical Methods in Data Mining Bitara, Matúš ; Žák, Libor (referee) ; Bednář, Josef (advisor) The thesis deals with the comparison of conventional and heuristic methods in data mining used for binary classification. In the theoretical part, four different models are described. Model classification is demonstrated on simple examples. In the practical part, models are compared on real data. This part also consists of data cleaning, outliers removal, two different transformations and dimension reduction. In the last part methods used to quality testing of models are described. Detailed record
	Fundamental Analysis for Automatic Trading Systems Miček, Marek ; Kanich, Ondřej (referee) ; Rozman, Jaroslav (advisor) This thesis deals with the creation of automatic trading systems which are able to predict market trends for stocks selected in advance. Proper trading strategy of this system is mainly created from the elements of fundamental analysis, such as annual returns of company, it's gains, level of shareholder's equity or total debt. All the stocks are classified by these fundaments, where result of this classification determines whether to buy or sell the stock. For the purpose of this thesis, 5 autamatic trading systems were created in order to compare different approaches to the stock evaluation, managment or diversification of business portfolio. Created systems were properly tested on historical data and, in order to determine their level of complexity, tests were executed in both periods of economic recession and expansion too. All the created systems reported great returns and most of them have potential to generate long-term gains. On the basis of received results, it is possible to make conclusion that fundamental analysis has a high value in the field of automatic trading systems, and it increases the chances of generating a profit. Detailed record
	Data Mining Stehno, David ; Hynčica, Tomáš (referee) ; Honzík, Petr (advisor) The aim of the thesis was to study and describe data mining methodology CRISP-DM. From the collected database of calls to the call center a prediction was performed, based on CRISP-DM methodology. In phase of test situation modeling four different testing methods were used: the k-NN, neural network, linear regression and super vector machine. The input attributes importance for further prediction was evaluated based on different selections. The results and findings may provide data for further more accurate forecasts in the future; not only in number of calls but also other indicators relevant to the call center. Detailed record
	Advanced Data Mining in Cardiology Mézl, Martin ; Provazník, Ivo (referee) ; Sekora, Jiří (advisor) The aim of this master´s thesis is to analyse and search unusual dependencies in database of patients from Internal Cardiology Clinic Faculty Hospital Brno. The part of the work is theoretical overview of common data mining methods used in medicine, especially decision trees, naive Bayesian classifier, artificial neural networks and association rules. Looking for unusual dependencies between atributes is realized by association rules and naive Bayesian classifier. The output of this work is a complex system for Knowledge discovery in databases process for any data set. This work was realized with collaboration of Internal Cardiology Clinic Faculty Hospital Brno. All programs were made in Matlab 7.0.1. Detailed record
	Analysis of Operational Data and Detection od Anomalies during Supercomputer Job Execution Stehlík, Petr ; Nikl, Vojtěch (referee) ; Jaroš, Jiří (advisor) V posledních letech jsou superpočítače stále větší a složitější, s čímž souvisí problém využití plného potenciálu systému. Tento problém se umocňuje díky nedostatku nástrojů pro monitorování, které jsou specificky přizpůsobeny uživatelům těchto systémů. Cílem práce je vytvořit nástroj, nazvaný Examon Web, pro analýzu a vizualizaci provozních dat superpočítače a provést nad těmito daty hloubkovou analýzu pomocí neurálních sítí. Ty určí, zda daná úloha běžela korektně, či vykazovala známky podezřelého a nežádoucího chování jako je nezarovnaný přístup do operační paměti nebo např. nízké využití alokovaých zdrojů. O těchto faktech je uživatel informován pomocí GUI. Examon Web je postavený na frameworku Examon, který sbírá a procesuje metrická data ze superpočítače a následně je ukládá do databáze KairosDB. Implementace zahrnuje disciplíny od návrhu a implementace GUI, přes datovou analýzu, těžení dat a neurální sítě až po implementaci rozhraní na serverové straně. Examon Web je zaměřen zejména na uživatele, ale může být také využíván administrátory. GUI je vytvořeno ve frameworku Angular s knihovnami Dygraphs a Bootstrap. Uživatel díky tomu může analyzovat časové řady různých metrik své úlohy a stejně jako administrátor se může informovat o současném stavu superpočítače. Tento stav je zobrazen jako několik globálně agregovaných metrik v posledních 30 minutách nebo jako 3D model (či 2D model) superpočítače, který získává data ze samotných uzlů pomocí protokolu MQTT. Pro kontinuální získávání dat bylo využito rozhraní WebSocket s vlastním mechanismem přihlašování a odhlašování konkretních metrik zobrazovaných v modelu. Při analýze spuštěné úlohy má uživatel dostupné tři různé pohledy na danou úlohu. První nabízí celkový přehled o úloze a informuje o využitých zdrojích, času běhu a vytížení části superpočítače, kterou úloha využila společně s informací z neurálních sítí o podezřelosti úlohy. Další dva pohledy zobrazují metriky z výkonnostiního energetického hlediska. Pro naučení neurálních sítí bylo potřeba vytvořit novou datovou sadu ze superpočítače Galileo. Tato sada obsahuje přes 1100 úloh monitorovaných na tomto superpočítači z čehož 500 úloh bylo ručně anotováno a následně použito pro trénování sítí. Neurální sítě využívají model back-propagation, vhodný pro anotování časových sérií fixní délky. Celkem bylo vytvořeno 12 sítí pro metriky zahrnující vytížení procesoru, paměti a dalších části a např. také podíl celkového času procesoru v úsporném režimu C6. Tyto sítě jsou na sobě nezávislé a po experimentech jejich finální konfigurace 80-20-4-3-1 (80 vstupních až 1 výstupní neuron) podávaly nejlepší výsledky. Poslední síť (v konfiguraci 12-4-3-1) anotovala výsledky předešlých sítí. Celková úspěšnost systému klasifikace do 2 tříd je 84 %, což je na použitý model velmi dobré. Výstupem této práce jsou dva produkty. Prvním je uživatelské rozhraní a jeho serverová část Examon Web, která jakožto rozšiřující vrstva systému Examon pomůže s rozšířením daného systému mezi další uživatele či přímo další superpočítačová centra. Druhým výstupem je částečně anotovaná datová sada, která může pomoci dalším lidem v jejich výzkumu a je výsledkem spolupráce VUT, UNIBO a CINECA. Oba výstupy budou zveřejněny s otevřenými zdrojovými kódy. Examon Web byl prezentován na konferenci 1st Users' Conference v Ostravě pořádanou IT4Innovations. Další rozšíření práce může být anotace datové sady a také rozšíření Examon Web o rozhodovací stromy, které určí přesný důvod špatného chování dané úlohy. Detailed record
	Vertebra detection and identification in CT oncological data Věžníková, Romana ; Harabiš, Vratislav (referee) ; Jakubíček, Roman (advisor) Automated spine or vertebra detection and segmentation from CT images is a difficult task for several reasons. One of the reasons is unclear vertebra boundaries and indistinct boundaries between vertebra. Next reason is artifacts in images and high degree of anatomical complexity. This paper describes the design and implementation of vertebra detection and classification in CT images of cancer patients, which adds to the complexity because some of vertebrae are deformed. For the vertebra segmentation, the Otsu’s method is used. Vertebra detection is based on search of borders between individual vertebra in sagittal planes. Decision trees or the generalized Hough transform is applied for the identification whereas the vertebra searching is based on similarity between each vertebra model shape and planes of CT scans. Detailed record
	Project Management and Risks Visualization Support Tool Přibylová, Kateřina ; Trchalík, Roman (referee) ; Kreslíková, Jitka (advisor) This master thesis deals with the topic of risk management and explains its importance during project management in IT projects. It descibes every phase of the risk management life cycle and also methods and procedures used in each phase. After that the thesis focuses on decision analysis, mainly on decision trees and Monte Carlo simulation. The last part contains the design of an application for risk management and visualization. Prototype of this system has been implemented based on this design in PHP framework Laravel. Details of the implementation and testing are in the next part of this thesis. In the end there is an assessment with discussion of possible expansions. Detailed record
	Data Analysis of a Company Producing Medical Supplies Kulhánková, Monika ; Bartík, Vladimír (referee) ; Burgetová, Ivana (advisor) This bachelor's thesis deals with the analysis of the company's sales data, specifically the classification of the customer's type according to his sales data. It provides a theoretical introduction to data mining. It describes the classification process and methods for creating classifiers and presents the CRISP-DM model. This thesis describes the provided data sets, from which the relevant attributes are selected. The data are preprocessed and used in the creation and testing of classification models. The result of this thesis is a comparison of the achieved results. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English