National Repository of Grey Literature 117 records found  beginprevious108 - 117  jump to record: Search took 0.01 seconds. 
Quality measures of classification models and their conversion
Hanusek, Lubomír ; Hebák, Petr (advisor) ; Řezanková, Hana (referee) ; Skalská, Hana (referee)
Predictive power of classification models can be evaluated by various measures. The most popular measures in data mining (DM) are Gini coefficient, Kolmogorov-Smirnov statistic and lift. These measures are each based on a completely different way of calculation. If an analyst is used to one of these measures it can be difficult for him to asses the predictive power of a model evaluated by another measure. The aim of this thesis is to develop a method how to convert one performance measure into another. Even though this thesis focuses mainly on the above-mentioned measures, it deals also with other measures like sensitivity, specificity, total accuracy and area under ROC curve. During development of DM models you may need to work with a sample that is stratified by values of the target variable Y instead of working with the whole population containing millions of observations. If you evaluate a model developed on a stratified data you may need to convert these measures to the whole population. This thesis describes a way, how to carry out this conversion. A software application (CPM) enabling all these conversions makes part of this thesis. With this application you can not only convert one performance measure to another, but you can also convert measures calculated on a stratified sample to the whole population. Besides the above mentioned performance measures (sensitivity, specificity, total accuracy, Gini coefficient, Kolmogorov-Smirnov statistic), CPM will also generate confusion matrix and performance charts (lift chart, gains chart, ROC chart and KS chart). This thesis comprises the user manual to this application as well as the web address where the application can be downloaded. The theory described in this thesis was verified on the real data.
Financial Balance of Households
Siegelová, Klára ; Bartošová, Jitka (advisor) ; Bína, Vladislav (referee)
The main task of this thesis deals with the financial balance of Czech households. The thesis monitors changes in the development of the debt burden of households, but it also pursue the development of their economies. The chosen period is the period from approximately 1997 to 2007 before the financial crisis and the period during the crisis in 2010. The aim of this paper is statistical analysis of data file Household budget survey in 2009. It focuses on the financial balance of households based on the difference between revenue and expenditure of households.
Modeling development of incurred value of claim
Kantorová, Petra ; Zimmermann, Pavel (advisor) ; Hrevuš, Jan (referee)
This diploma project is focused on the estimation of incurred value of claim and probability of the claim remaining opened (not settled) in the specific stage of the insurance settlement process. The change of incurred value of claim means the change of settlement process stage. Generalized linear model is used for modelling these changes. Classical linear regression model also belongs into this theory, which is its special case, just with stricter premises. Generalized linear model among others allows solving the problem of heteroscedasticity in the unusual way using joint model. This model is applied in the practical part of this piece of work. Logistic regression is the part of the generalized linear model theory, which helps to model the probability of the claim remaining opened in this piece of work. The model outcome is presented in graphic way, especially the graphs containing probability that levels of given claim will occur in certain range.
Scoring Models in Finance (Skóringové modely ve financích)
Rychnovský, Michal ; Zouhar, Jan (advisor) ; Kalčevová, Jana (referee)
The aim of the present work is to describe the application of the logistic regression model to the field of probability of default modeling, and provide a brief introduction to the scoring development process used in financial practice. We start by introducing the theoretical background of the logistic regression model; followed by a consequent derivation of three most common scoring models. Then we present a formal definition of the Gini coefficient as a diversification power measure and derive the Somers-type formulas for its estimation. Finally, the key part of this work gives an overview of the whole scoring development process illustrated on the examples of real business data.
Comparison of approaches to creating credit scoring models
Hofman, Elena ; Šedivý, Jan (advisor)
This work is focused on the management of a credit risk related to the traditional bank lending business to individuals. The paper deals with a theory of measuring risk with help of PD (Probability of Default) parameter when different scoring models are used. The goal is to outline an issue with the credit risk and its management in general, attention is paid to details of a process of creating scoring models. There are three specific modeling techniques listed, namely logistic regression, decision trees and neural networks. Methods are explained in detail and are given possibilities of mutual comparison. The application part is devoted to the evaluation and comparison of credit scoring models based on these methods.
Comparison of the potency of application KDD methods and statistical methods in the analysis of ADAMEK data
Líbal, Petr ; Rauch, Jan (advisor) ; Berka, Petr (referee)
This bachelor thesis compares association rules and logistic regression. For this comparison medical data Adamek have been used. The relationship between attributes belonging to a group of Physical examinations and Difficulty has been studied. Both methods are theoretically described, their connection with the related common areas is mentioned - the analysis of market basket in the case of association rules, linear regression in the case of logistic regression. Before the analysis attributes are described with basic statistics and the distribution of values is graphically illustrated. In both cases, analysis proceed the same way. First, the relationship of each difficulty is examined, then is examined relationship of difficulties in general. In conclusion, the results of both methods is compared.
The Application of the Discrete Choice Models
Čejková, Tereza ; Hušek, Roman (advisor) ; Fíglová, Zuzana (referee)
This thesis treats with the theory, interpretation and application of the Discrete Choice Models. The theoretical part contains the Fitting the Logistic Regression Model, Testing for the Significance of the Coefficients, Testing for the Significance of the Model. The Multiple Logistic Regression is mentioned too. The model was applied to interview data from the International research called Reflex.
Aplikace metod vícerozměrné analýzy v sociologických průzkumech
Kopáčková, Jitka ; Fialová, Hana (advisor) ; Průšová, Petra (referee)
Cílem diplomové práce je použít statistické metody vícerozměrné analýzy k hlubšímu poznání preferencí vysokoškolsky vzdělaných zaměstnanců. Především jde o nalezení zásadních podnětů, které mohou ovlivňovat spokojenost vysokoškolsky vzdělaného zaměstnance v jeho zaměstnání. K dosažení tohoto cíle je třeba provést výběr vhodných statistických metod a postupů a úspěšně je aplikovat.

National Repository of Grey Literature : 117 records found   beginprevious108 - 117  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.