National Repository of Grey Literature 88 records found  beginprevious79 - 88  jump to record: Search took 0.01 seconds. 
Emotional State Recognition Based on Speech Signal Analysis
Čermák, Jan ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The thesis is focused on the emotional states classification in the Matlab program, using neural networks and the classifier which is based on a combination of Gaussian density functions. It deals with the speech signal processing; the prosodic and spectral signs and the MFCC coefficients were extracted from the signal. The work also deals with the quality evaluation of individual signs of which the most suitable were chosen in order to provide the correct classification of emotional states. In order to identify the emotional states, two different methods were used. The first method of classification was the use of neural networks with differently selected parameters, and the second method was the use of the Gaussian mixture model (GMM). In both methods, a database of emotional utterances was divided into the training group and the test group. The testing was based on a method independent of the speaker. The work also includes the comparison of individual analyzed methods as well as the representation and comparison of the results. The conclusion comprises a proposition for the best parameters and the best classifier for the recognition of the speaker’s emotional state.
Automatic Recognition of Logopaedic Defect in Speech Utterances
Dušil, Lubomír ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The thesis is aimed at an analysis and automatic detection of logopaedic defects in speech utterance. Its objective is to facilitate and accelerate the work of logopaedists and to increase percentage of detected logopaedic defects in children of the youngest possible age followed by the most successful treatment. It presents methods of speech work, classification of the defects within individual stages of child development and appropriate words for identification of the speech defects and their subsequent remedy. After that there are analyses of methods of calculating coefficients which reflect human speech best. Also classifiers which are used to discern and determine whether it is a speech defect or not. Classifiers exploit coefficients for their work. Coefficients and classifiers are being tested and their best combination is being looked for in order to achieve the highest possible success rate of the automatic detection of the speech defects. All the programming and testing jobs has been conducted in the Matlab programme.
Automatic vocal-oriented recognition of human emotions
Houdek, Miroslav ; Přinosil, Jiří (referee) ; Atassi, Hicham (advisor)
This master thesis concerns with emotional states and gender recognition on the basis of speech signal analysis. We used various prosodic and cepstral features for the description of the speech signal. In the text we describe non-invasive methods for glottal pulses estimation. The described features of speech were implemented in MATLAB. For their classification we used the GMM classifier, which uses the Gaussian probability distribution for modeling a feature space. Furthermore, we constructed a system for recognition of emotional states of the speaker and a system for gender recognition from speech. We tested the success of created systems with several features on speech signal segments of various lengths and compared the results. In the last part we tested the influence of speaker and gender on the success of emotional states recognition.
Detection of Logopaedic Defects in Speech
Pešek, Milan ; Smékal, Zdeněk (referee) ; Atassi, Hicham (advisor)
The thesis deals with a design and an implementation of software for a detection of logopaedia defects of speech. Due to the need of early logopaedia defects detecting, this software is aimed at a child’s age speaker. The introductory part describes the theory of speech realization, simulation of speech realization for numerical processing, phonetics, logopaedia and basic logopaedia defects of speech. There are also described used methods for feature extraction, for segmentation of words to speech sounds and for features classification into either correct or incorrect pronunciation class. In the next part of the thesis there are results of testing of selected methods presented. For logopaedia speech defects recognition algorithms are used in order to extract the features MFCC and PLP. The segmentation of words to speech sounds is performed on the base of Differential Function method. The extracted features of a sound are classified into either a correct or an incorrect pronunciation class with one of tested methods of pattern recognition. To classify the features, the k-NN, SVN, ANN, and GMM methods are tested.
Usage of advanced signal processing techniques for motor traffic safety enhancement
Beneš, Radek ; Říha, Kamil (referee) ; Atassi, Hicham (advisor)
This diploma thesis deals with the issue of the recognition of road signs in the video sequence. Such systems increase the traffic safety and are implemented by major car factories in the manufactured cars (Opel, BMW). First, the motivation for the utilisation of these systems is presented, followed by the survey of the current state of the art methods. Finally, a specific road-sign detection method is chosen and described in detail. The method uses advanced techniques of signal processing. Segmentation method in color space is used for sign detection and subsequent classification is accomplished by linear classification with optional use of PCA method. In addition, the method contains the prediction of road sign positions based on Kalman filtering. Implemented system yields relatively accurate results and overall analysis and discussion is enclosed.
Computer analysis of sport matches
Židlík, Pavel ; Balík, Miroslav (referee) ; Atassi, Hicham (advisor)
This work deals with the possibility of a fast football match analysis from audio part of record with the possibility of implementation of some methods for other than football matches as well. The first intention was concentrated on detection of whiz of the soccer whistle that has specific frequency in its specter, which is out of common speech frequency. After detection harmonic frequency , the attention was focused on the definition of whiz meaning. Referee was helpful with the issue as he informed me about the number of whiz styles and provided me with referential samples for whiz classification. Neural network with back propagation was used for definition of whiz meaning. Another subject for detection of important moments of the match was concentration on the commentator’s basic tone. In case the commentator is really excited with the match, his basic speech tone automatically intensifies with every important action of the game. Analysis of commentator’s intensified basic speech tone was realized in this work too. Also the national hymns of teams playing against each other are a significant moment of the match. That is why detection of a hymn became another subject of analysis. Advantages of MFCC were used to obtain audio signal feature, from which 20 coefficients were gained. These were used as an entrance for classifier based on neural network with back propagation. For easy usage of these methods a graphic user interface with possibility of well-arranged look on gained results and also with possibility of replaying chosen section was created.
Neural networks in speaker classification
Svoboda, Libor ; Atassi, Hicham (referee) ; Míča, Ivan (advisor)
The content of this work is focused on the neural network per speaker recognition. The work deals with problems of processing speech signal and there are introduction some types of neural network. The part of work was made database of records from speakers with have various sex and ages. The train and test group was made from the database. For classifier were suggested afterwards. One of them was nominated on base Gaussian mixture model and three of them were nominated on neural. This system was tested and analyzed on the basis of age, gender and both criterions each other at the end. Attention is focused on choice suitable feature in each mission of classification at the same time. At the end of work are introduced results of analysis for individual groups and features. The most suitable features are diagnosed from given mission of classification and the most prosperous classifier.
Automatic / Automated recogniton of emotional states based on utterance analysis
Pfeifer, Leon ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The diploma thesis deals with the analysis of human emotional states. The thesis consists of three parts. The first part is charcterize, the process of speech generating, from phonetic and psychological poin of view. In the second part there are proccesed metods and contextual things.(preprocessing of signal, voice activity detector). For calculation fundamental Frequency it was used metod of central clipping, another used metod is formant frequency analyse and the last is metod of determinatin of nuber of thorns and planes. In the thirt part there are proccesesed results of measurements performed by particural metods. It was scorred five different emotional states: neutral, anger, happiness, sadness and surprise. At the end of this part there are discussed results for each metod.
The simulation of biometric protection systems working on the face recognition principle
Dubský, Milan ; Rampl, Ivan (referee) ; Atassi, Hicham (advisor)
The aim of this work is to realize a system in the Matlab-Simulink environment, which will be able to detect and recognize the human face from the input image. The created model will actually simulate the biometric security systems working on the principle of face recognition. The work is divided into two parts. In the first part, several methods for face detection from image are described. We focused on the symptomatic oriented and color segmentation methods. The pattern matching method is also described and implemented; the advantage ofthe pattern matching that it can be used either for face detection or face recognition. The second part of this work contains a description of the face recognition. Where PCA (Principal Component Analysis) are used for this task, this part of the work also includes experimental results of tests performed on our methods.
Identification of emotional state using speech signal analysis
Navrátil, Michal ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The diploma thesis deals with the analysis of human emotional states speaker by the help of analyse speech signals. The thesis has two parts. In the first part, the process of speech generating is described in addition to the description of the commonly used pre-processing methods such as denoising or preemphasis. The first part also deals with the major and minor prosody features, these features are: the fundamental frequency, energy, spectral features and time domain features such as the speech rate. The second part of this thesis deals with a task of emotion recognition from the speech signal. When we accumulate sufficient of the number of recordings emotive state will be able to rekognize emotive state with high probability. All project is prepared for use in real time. The last part of this thesis thesis contains description and results of the experiments made on a large number of speech records.

National Repository of Grey Literature : 88 records found   beginprevious79 - 88  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.