National Repository of Grey Literature 111 records found  beginprevious99 - 108next  jump to record: Search took 0.00 seconds. 
Robustness Aspects of Knowledge Discovery
Kalina, Jan
The sensitivity of common knowledge discovery methods to the presence of outlying measurements in the observed data is discussed as their major drawback. Our work is devoted to robust methods for information extraction from data. First, we discuss neural networks for function approximation and their sensitivity to the presence of noise and outlying measurements in the data. We propose to fit neural networks in a robust way by means of a robust nonlinear regression. Secondly, we consider information extraction from categorical data, which commonly suffers from measurement errors. To improve its robustness properties, we propose a regularized version of the common test statistics, which may find applications e.g. in pattern discovery from categorical data.
Robust Knowledge Discovery from High-Dimensional Data
Kalina, Jan
The paper is devoted to advanced robust methods for information extraction from highdimensional data. The concept of knowledge discovery is discussed together with its two important aspects: high dimensionality of the data and sensitivity to the presence of outlying data values. We propose new robust methods for knowledge discovery suitable for highdimensional data. They are based on the idea of implicit weighting, which is inspired by the least weighted squares regression estimator. We propose a highly robust method for a dimension reduction, which can be described as a robust alternative of the principal component analysis based on implicit down-weighting of less reliable data values. Further, we propose a novel robust approach to cluster analysis, which is a popular knowledge discovery method of unsupervised learning. A two-stage cluster analysis method tailor-made for highdimensional data is obtained by combining the robust principal component analysis with the robust cluster analysis. The procedure can be interpreted as a robust knowledge discovery method tailor made for high-dimensional data.
Nonlinear Trend Modeling in the Analysis of Categorical Data
Kalina, Jan
This paper studies various approaches to testing trend in the context of categorical data. While the linear trend is far more popular in econometric applications, a nonlinear modeling of the trend allows a more subtle information extraction from real data, especially if the linearity of the trend cannot be expected and verified by hypothesis testing. We exploit the exact unconditional approach to propose alternative versions of some trend tests. One of them is the test of relaxed trend (Liu, 1998), who proposed a generalization of the classical Cochran- Armitage test of linear trend. A numerical example on real data reveals the advantages of the test of relaxed trend compared to the classical test of linear trend. Further, we propose an exact unconditional test also for modeling association between an ordinal response and nominal regressor. Further, we propose a robust estimator of parameters in the logistic regression model, which is based on implicit weighting of individual observations. We assess the breakdown point of the newly proposed robust estimator.
Logistic and Poisson regression
Kalina, Jan
The work describes methods of logistic and Poisson regression for the analysis of contingency tables. It explains general principles of hypothesis testing about the significance of their parameters. Particular methods can be derived also within a general theory of generalized linear models. For this general context also principles of variable selection are investigated. It is shown how properties of particular methods correspond to properties, which can be derived in the framework of generalized linear models.
Analysis of contingency tables
Kalina, Jan
The paper compares asymptotic, exact conditional and exact unconditional tests for the evaluation of contingency tables. Relationship of asymptotic methods with generalized linear models is studied. The work focuses on exact methods, which yield promising results in the analysis of data with small samples. Some of the methods are alternatively derived in the context of logistic and Poisson regression.
On Heteroscedasticity in Robust Regression
Kalina, Jan
This work studies the phenomenon of heteroscedasticity and its consequences for various methods of linear regression, including the least squares, least weighted squares and regression quantiles. We focus on hypothesis tests for these regression methods. The new approach consists in deriving asymptotic heteroscedasticity tests for robust regression, which are asymptotically equivalent to standard tests computed for the least squares regression. One approach to modeling heteroscedasticity assumes a prior knowledge or specific model for the variability of random regression errors. Another (and more general) approach does not assume a specific form of heteroscedasticity. The paper also describes heteroscedastic regression, which is a tool to incorporate heteroscedasticity to the model. This allows us to define the heteroscedastic least weighted squares regression.

National Repository of Grey Literature : 111 records found   beginprevious99 - 108next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.