National Repository of Grey Literature 15 records found  1 - 10next  jump to record: Search took 0.01 seconds. 
Establishing speaker's age and sex
Rendek, Tomáš ; Pfeifer, Václav (referee) ; Atassi, Hicham (advisor)
This work deals with speaker´s age and gender recognition. At the beginning it introduces the practical usage of this application and discusses the solutions available. The theoretical part of the thesis specifies the feature extraction and reduction methods and speech databases used in the experiments. The practical part describes the recognizer implemented in the Emotional tool and in two chapters describes the individual experiments. Regarding speaker´s gender estimation; we focused on the impact of the emotional state and speaker's age on the classification process. The two remain experiments were dedicated for general gender estimation performed by using two different classifiers – GMM and k-NN. These two classifiers were used in age estimation as well. In this case, four Group of age was formed and two different feature sets namely: segmental and suprasegmental were exploited four groups
Set of JavaApplets Demonstrations for Speech Processing
Kudr, Michal ; Karafiát, Martin (referee) ; Černocký, Jan (advisor)
The goal of the thesis is being familiar with methods a techniques used in speech processing. Using the obtained knowledge I propose three JavaApplets demonstrating selected methods. In this thesis we can find the theoretical analysis of selected problems.
Automatic / Automated recogniton of emotional states based on utterance analysis
Pfeifer, Leon ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The diploma thesis deals with the analysis of human emotional states. The thesis consists of three parts. The first part is charcterize, the process of speech generating, from phonetic and psychological poin of view. In the second part there are proccesed metods and contextual things.(preprocessing of signal, voice activity detector). For calculation fundamental Frequency it was used metod of central clipping, another used metod is formant frequency analyse and the last is metod of determinatin of nuber of thorns and planes. In the thirt part there are proccesesed results of measurements performed by particural metods. It was scorred five different emotional states: neutral, anger, happiness, sadness and surprise. At the end of this part there are discussed results for each metod.
Modelling Prosodic Dynamics for Speaker Recognition
Jančík, Zdeněk ; Fapšo, Michal (referee) ; Matějka, Pavel (advisor)
Most current automatic speaker recognition system extract speaker-depend features by looking at short-term spectral information. This approach ignores long-term information. I explored approach that use the fundamental frequency and energy trajectories for each speaker. This approach models prosody dynamics on single fonemes or syllables. It is known from literature that prosodic systems do not work as well the acoustic one but it improve the system when fusing. I verified this assumption by fusing my results with state of the art acoustic system from BUT. Data from standard evaluation campaigns organized by National Institute of Standarts and Technology are used for all experiments.
Determining person's height from spoken utterance
Pelikán, Pavel ; Mekyska, Jiří (referee) ; Atassi, Hicham (advisor)
Diploma’s thesis is focused on determining person’s height from spoken utterance. First part of the work evaluates present situation and refers to the published studies. Knowledge gained in these studies was used in this thesis. Study with the best results according to estimated height of the speakers was chosen. The experiment realized in the chosen study was performed in this work. The system for the estimation of the height of the speakers based on the speech signal was created. This system was successfully tested by using several acoustic features on spoken utterances from TIMIT database.
Identification of emotional state using speech signal analysis
Navrátil, Michal ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The diploma thesis deals with the analysis of human emotional states speaker by the help of analyse speech signals. The thesis has two parts. In the first part, the process of speech generating is described in addition to the description of the commonly used pre-processing methods such as denoising or preemphasis. The first part also deals with the major and minor prosody features, these features are: the fundamental frequency, energy, spectral features and time domain features such as the speech rate. The second part of this thesis deals with a task of emotion recognition from the speech signal. When we accumulate sufficient of the number of recordings emotive state will be able to rekognize emotive state with high probability. All project is prepared for use in real time. The last part of this thesis thesis contains description and results of the experiments made on a large number of speech records.
Detection of the voice fundamental frequency
Chloupek, Jiří ; Mekyska, Jiří (referee) ; Sysel, Petr (advisor)
This bachelor thesis deals with the detection of the pitch man. The frequency of the basic tone is one of the basic parameters of speech signal in the frequency domain. In this thesis we describe several methods for pitch detection and practical application of correlation method and cepstral analysis.
Detection of the voice fundamental frequency
Chloupek, Jiří ; Mekyska, Jiří (referee) ; Sysel, Petr (advisor)
This bachelor thesis deals with the detection of the pitch man. The frequency of the basic tone is one of the basic parameters of speech signal in the frequency domain. In this thesis we describe several methods for pitch detection and practical application of correlation method and cepstral analysis.
Precise Detection of Musical Instrument Pitch
Hyrák, Jakub ; Skála, František (referee) ; Černocký, Jan (advisor)
The goal of this Bachelor's thesis is precise analyse of sound signal from musical instrument in real time and detect of fundamental tone. You can find a description of musical theory and methods, which solving this problem. The main part is a description of implementation of resulting application with using chosen method Constant Q transform. The final application may be used to tunning musical instruments.
Set of JavaApplets Demonstrations for Speech Processing
Kudr, Michal ; Karafiát, Martin (referee) ; Černocký, Jan (advisor)
The goal of the thesis is being familiar with methods a techniques used in speech processing. Using the obtained knowledge I propose three JavaApplets demonstrating selected methods. In this thesis we can find the theoretical analysis of selected problems.

National Repository of Grey Literature : 15 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.