National Repository of Grey Literature 2 records found  Search took 0.01 seconds. 
Statistical methods in stylometry
Dupal, Pavel ; Kaspříková, Nikola (advisor) ; Šulc, Zdeněk (referee)
The aim of this thesis is to provide an overview of some of the commonly used methods in the area of authorship attribution (stylometry). The text begins with a recap of history from the end of the 19th century to present time and the required terminology from the field of text mining is presented and explained. What follows is a list of selected methods from the field of multidimensional statistics (principal components analysis, cluster analysis) and machine learning (Support Vector Machines, Naive Bayes) and their application as pertains to stylometrical problems, including several methods created specifically for use in this field (bootstrap consensus tree, contrast analysis). Finally these same methods are applied to a practical problem of authorship verification based on a corpus bulit from the works of four internet writers.
A comparison of frequentist and Bayesian approaches to probability
Dupal, Pavel ; Karel, Tomáš (advisor) ; Bílková, Diana (referee)
The aim of this thesis is to provide a basic comparison between the classical (frequentist) and Bayesian schools of statistics on both the historical and the scientific level. The history of statistics is examined in short starting from the 17th century through the recent years. Key figures and discoveries are mentioned, as well as some of the occasions when the frequentist and Bayesian approaches clashed or influenced one another. Different interpretations of probability are listed and the Bayesian approach is introduced via Bayes' Theorem. The frequentist and Bayesian methods for point estimation, interval estimation and hypothesis testing are then described and compared. Every method is then further presented in an example and the results are evaluated using selected criteria.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.