National Repository of Grey Literature 88 records found  1 - 10nextend  jump to record: Search took 0.00 seconds. 
Emotion Recognition from Acted and Spontaneous Speech
Atassi, Hicham ; Přibil, Jiří (referee) ; Zahradník, Pavel (referee) ; Smékal, Zdeněk (advisor)
Dizertační práce se zabývá rozpoznáním emočního stavu mluvčích z řečového signálu. Práce je rozdělena do dvou hlavních častí, první část popisuju navržené metody pro rozpoznání emočního stavu z hraných databází. V rámci této části jsou představeny výsledky rozpoznání použitím dvou různých databází s různými jazyky. Hlavními přínosy této části je detailní analýza rozsáhlé škály různých příznaků získaných z řečového signálu, návrh nových klasifikačních architektur jako je například „emoční párování“ a návrh nové metody pro mapování diskrétních emočních stavů do dvou dimenzionálního prostoru. Druhá část se zabývá rozpoznáním emočních stavů z databáze spontánní řeči, která byla získána ze záznamů hovorů z reálných call center. Poznatky z analýzy a návrhu metod rozpoznání z hrané řeči byly využity pro návrh nového systému pro rozpoznání sedmi spontánních emočních stavů. Jádrem navrženého přístupu je komplexní klasifikační architektura založena na fúzi různých systémů. Práce se dále zabývá vlivem emočního stavu mluvčího na úspěšnosti rozpoznání pohlaví a návrhem systému pro automatickou detekci úspěšných hovorů v call centrech na základě analýzy parametrů dialogu mezi účastníky telefonních hovorů.
Establishing speaker's age and sex
Rendek, Tomáš ; Pfeifer, Václav (referee) ; Atassi, Hicham (advisor)
This work deals with speaker´s age and gender recognition. At the beginning it introduces the practical usage of this application and discusses the solutions available. The theoretical part of the thesis specifies the feature extraction and reduction methods and speech databases used in the experiments. The practical part describes the recognizer implemented in the Emotional tool and in two chapters describes the individual experiments. Regarding speaker´s gender estimation; we focused on the impact of the emotional state and speaker's age on the classification process. The two remain experiments were dedicated for general gender estimation performed by using two different classifiers – GMM and k-NN. These two classifiers were used in age estimation as well. In this case, four Group of age was formed and two different feature sets namely: segmental and suprasegmental were exploited four groups
Speech-signal-based recognition of type of transmission channel
Kopřiva, Tomáš ; Burget, Radim (referee) ; Atassi, Hicham (advisor)
This work deals with the classification of five different transmission channels by speech signal processing. The channels considered are: GSM, two PSTN channels and two VoIP channels. For the training and testing purposes, a speech database for the transmission channels called SPLAB_TranCh was constructed. The speech signals of this corpus originally come from well-known TIMIT database, where each utterance passed through each mentioned transmission channel. The main objective of this work is to find optimal features and classification accuracy that yield best classification accuracy. Several types of features, including MFCC, LPCC and spectral characteristics were put under examination. The best suprasegmental features were identified by using mRMR algorithm. Several classifiers were tested as well. The results suggested that the classification of transmission channel can be performed with high accuracy (around 90 %). Influence of adverse effects, which can occur during transmission, is also examined. Considered types of distortions are: saturation, thresholding, echo, crackling noises and different colors of noises and filters.
Czech phonetic transcription
Zedek, Martin ; Atassi, Hicham (referee) ; Sysel, Petr (advisor)
The aim is to create a script for automatic transcription of Czech language phonetic forward and backward. The thesis compares the different phonetic alphabet and describes their advantages and disadvantages. For this thesis was eventually selected Czech Phonetic Alphabet (ČFA). The following are the most important rules for connection of vowels and consonants and the main assimilation rules. The script was created in MATLAB environment. The function is to convert the phonetic form using the rules described. Backward conversion is not fully debugged and it is necessary to refine the result using Aspell program for correcting spelling.
Advanced analysis of spontaneous spoken dialogues
Musil, David ; Burget, Radim (referee) ; Atassi, Hicham (advisor)
First part of this thesis contains information about call centers. This chapter focuses at their origin and history and basic division of software call centers use. Second chapter contains ways of processing speech and methods for call quality evaluation. Next chapter provides a description of basic dialogue features such as reaction, hesitation and interruption. Following chapters contain desing and description of graphic environment designed in Matlab programming language for previously mentioned basic dialogue features from data provided by thesis supervisor and their use for conversation analysis. Second part of thesis describes the process of creation a call assessment. It includes gathering necessary data for this part of thesis, their subsequent modification, usage and discussion of results and foundings.
Speaker recognition
Kašpar, Ladislav ; Atassi, Hicham (referee) ; Sysel, Petr (advisor)
My bachelor thesis is devoted to the problem of speaker recognition. It includes the basic theory on this topic. The theory focuses on the calculation of parameters for speaker recognition and description of the procedure for speaker recognition. An application for speaker recognition has been written in Matlab. It uses techniques as frequency formants, cepstral coefficients and segmentation of the signal as the main parameters.
Emotional State Recognition Based on Speech Signal Analysis
Čermák, Jan ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The thesis is focused on the emotional states classification in the Matlab program, using neural networks and the classifier which is based on a combination of Gaussian density functions. It deals with the speech signal processing; the prosodic and spectral signs and the MFCC coefficients were extracted from the signal. The work also deals with the quality evaluation of individual signs of which the most suitable were chosen in order to provide the correct classification of emotional states. In order to identify the emotional states, two different methods were used. The first method of classification was the use of neural networks with differently selected parameters, and the second method was the use of the Gaussian mixture model (GMM). In both methods, a database of emotional utterances was divided into the training group and the test group. The testing was based on a method independent of the speaker. The work also includes the comparison of individual analyzed methods as well as the representation and comparison of the results. The conclusion comprises a proposition for the best parameters and the best classifier for the recognition of the speaker’s emotional state.
Videoconferencing application integrated in web browser
Mucha, Ján ; Atassi, Hicham (referee) ; Číka, Petr (advisor)
Bachelor thesis is focused on integrating videoconferencing application in the web browser. Implementation is realized using HTML5 and WebRTC tool, designed by web browsers makers such as Google and Firefox. To work with WebRTC is used API (Application Programming Interface) sipML5, which is implemented using JavaScript. This API contains package of functions and classes used to create videoconference application using simple commands. For signaling is used SIP protocol. Client application provides communication point-to-point, chat and multipoint communication. Multipoint communication was tested on the server application enabling the creation and management of videoconferencing rooms. For this was used software MCU Media Server and MediaMixer Server developed by Medooze company.
Usage of the MATLAB environment for neural networks
Lenk, Peter ; Atassi, Hicham (referee) ; Škorpil, Vladislav (advisor)
This bachelor thesis discusses the basic theory and modelling of neural networks in the software environment of MATLAB. The thesis can be divided into four parts. After an introduction into the thesis, the theoretical background of the neural netwoks is explained in the first chapter. This chapter features a brief history and a biological background of neural networks and deals with the basic network architectures and the training processes. The next part is the description of how to implement networks in a general way using the MATLAB enviroment, so it deals with preparation of data, creation, simulation and training of a neural network. The last part of the paper covers a design of two excersises created in order to introduce modelling of the neural networks in the MATLAB enviroment to the students.
Emotional State Recognition and Classification Based on Speech Signal Analysis
Černý, Lukáš ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The diploma thesis focuses on classification of emotions. Thesis deals about parameterization of sounds files by suprasegment and segment methods with regard for next used of these methods. Berlin database is used. This database includes many of sounds records with emotions. Parameterization creates files, which are divided to two parts. First part is used for training and second part is used for testing. Point of interest is self-organization network. Thesis includes Matlab´s program which can be used for parameterization of any database. Data are classified by self-organization network after parameterization. Results of hits rates are presented at the end of this diploma thesis.

National Repository of Grey Literature : 88 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.