National Repository of Grey Literature 53 records found  beginprevious41 - 50next  jump to record: Search took 0.01 seconds. 
Recognition of Poses and Gestures
Jiřík, Leoš ; Hradiš, Michal (referee) ; Zemčík, Pavel (advisor)
This thesis inquires the existing methods on the field of image recognition with regards to gesture recognition. Some methods have been chosen for deeper study and these are to be discussed later on. The second part goes in for the concenpt of an algorithm that would be able of robust gesture recognition based on data acquired within the AMI and M4 projects. A new ways to achieve precise information on participants position are suggested along with dynamic data processing approaches toward recognition. As an alternative, recognition using Gaussian Mixture Models and periodicity analysis are brought in. The gesture class in focus are speech supporting gestures. The last part demonstrates the results and discusses future work.
Multiplatform Application for Speaker Verification
Görig, Jan ; Matějka, Pavel (referee) ; Glembek, Ondřej (advisor)
Bachelor thesis considers speaker recognition without knowledge of spoken message. There are described current feature extraction methods and their evaluation using Gaussian mixture model. The practical output of this work is application for visualization of the recognition process. Developed application is cross platform and it uses Qt and BSAPI libraries.
Speech Recognition For Selected Languages
Schmitt, Jan ; Karafiát, Martin (referee) ; Janda, Miloš (advisor)
This bachelor's thesis deals with recognition of continues speech for three languages - Bulgarian, Croatian and Swedish. There are described basics of speech processing and recognition methods like acoustic modeling using hidden Markov models and gaussian mixture models. Another aim of this work is preparing data for those languages from GlobalPhone database, so they may be used with speech recognition toolkits Kaldi and HTK. With data prepared there are several models trained and tested using Kaldi toolkit.
Acoustic signal classification
Pospíšil, Aleš ; Balík, Miroslav (referee) ; Atassi, Hicham (advisor)
Bachelor's thesis is focused on automatic music genre classication. First part of work evaluates present situation and refer to published studies. Gained knowledge from there is applied in this work. In terms of nding solution for problem the work summarize and describe suitable music features and classication techniques like neural networks and k-nearest neighbor. Four selected classication classes were classical, electro, jazz and rock music. Result of work is user-friendly system that provides automatic music genre recognition. Achieved classication performance is more less comparable to human music genres recognition.
Retinal blood vessel segmentation in fundus images via statistical-based methods
Šolc, Radek ; Walek, Petr (referee) ; Odstrčilík, Jan (advisor)
This diploma thesis deals with segmentation of blood vessel from images acquired by fundus camera. The characteristic of fundus images and current methods of segmentation are described in theoretical part. The reach of the practical part is method using statistical model. The model using Student´s distribution for automatic segmentation is gradually drafted. Firstly EM- algorithm has been incorporated and model drafted on Markov random fields for improving robustness to noise after that. Contrast of thin blood vessel is improved in image preprocessing part by discrete wave transformation. The output image is used as mask for grayscale intensity decrease of thinnest blood-vessel and intensity increase of background. Whole model was programed in Matlab. The model was tested on whole HRF database. The quantitative evaluation of binary images were compared with golden standard images.
Emotional States of Humans and their Determination using Speech Record Analysis
Lněnička, Jakub ; Míča, Ivan (referee) ; Smékal, Zdeněk (advisor)
The aim of the diploma project is to find a method through which it will be possibleto classify the selected emotion from speech. At the beginning of the work deals with the description of the human body and their voice-generating operation. Furthermore, the text deals with the problem of the human voice into digital form.Great attention is paid to the parameters of the speech signal with an emphasis on describing the symptoms to help the selected emotion. The work deals with therecognition of emotions and a description of some of them. The main part is finding the best methods to reduce symptoms of segmental and suprasegmental speech utterances. The results of success was achieved by comparing the classification of selected emotions when using multiple methods and compare their results. The most important criterion in assessing the results ofthe reduction parameters of the speech signal, based on previous research in this area.
Paralinguistic signals recognition in spoken dialogs
Mašek, Jan ; Míča, Ivan (referee) ; Atassi, Hicham (advisor)
This document describes the three methods for the detection and classification of paralinguistic expressions such as laughing and crying from usual speech by analysis of the audio signal. The database of records was originally designed for this purpose. When analyzing everyday dialogs, music might be included, so the database was extended by four new classes as speech, music, singing with music and usual speech with background music. Feature extraction, feature reduction and classification are common steps in recognizing for all three methods. Difference of the methods is given by classification process in detail. One classification of all six classes at once is proposed in the first method called straight approach. In the second method called decision tree oriented approach we are using five intuitive sub classifiers in the tree structure and the final method uses for classification emotion coupling approach. The best features were reduced by feature evaluation using F-ratio and GMM classifiers were used for the each classification part.
Emotional State Recognition Based on Speech Signal Analysis
Čermák, Jan ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The thesis is focused on the emotional states classification in the Matlab program, using neural networks and the classifier which is based on a combination of Gaussian density functions. It deals with the speech signal processing; the prosodic and spectral signs and the MFCC coefficients were extracted from the signal. The work also deals with the quality evaluation of individual signs of which the most suitable were chosen in order to provide the correct classification of emotional states. In order to identify the emotional states, two different methods were used. The first method of classification was the use of neural networks with differently selected parameters, and the second method was the use of the Gaussian mixture model (GMM). In both methods, a database of emotional utterances was divided into the training group and the test group. The testing was based on a method independent of the speaker. The work also includes the comparison of individual analyzed methods as well as the representation and comparison of the results. The conclusion comprises a proposition for the best parameters and the best classifier for the recognition of the speaker’s emotional state.
Automatic vocal-oriented recognition of human emotions
Houdek, Miroslav ; Přinosil, Jiří (referee) ; Atassi, Hicham (advisor)
This master thesis concerns with emotional states and gender recognition on the basis of speech signal analysis. We used various prosodic and cepstral features for the description of the speech signal. In the text we describe non-invasive methods for glottal pulses estimation. The described features of speech were implemented in MATLAB. For their classification we used the GMM classifier, which uses the Gaussian probability distribution for modeling a feature space. Furthermore, we constructed a system for recognition of emotional states of the speaker and a system for gender recognition from speech. We tested the success of created systems with several features on speech signal segments of various lengths and compared the results. In the last part we tested the influence of speaker and gender on the success of emotional states recognition.
Neural networks in speaker classification
Svoboda, Libor ; Atassi, Hicham (referee) ; Míča, Ivan (advisor)
The content of this work is focused on the neural network per speaker recognition. The work deals with problems of processing speech signal and there are introduction some types of neural network. The part of work was made database of records from speakers with have various sex and ages. The train and test group was made from the database. For classifier were suggested afterwards. One of them was nominated on base Gaussian mixture model and three of them were nominated on neural. This system was tested and analyzed on the basis of age, gender and both criterions each other at the end. Attention is focused on choice suitable feature in each mission of classification at the same time. At the end of work are introduced results of analysis for individual groups and features. The most suitable features are diagnosed from given mission of classification and the most prosperous classifier.

National Repository of Grey Literature : 53 records found   beginprevious41 - 50next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.