National Repository of Grey Literature 240 records found  previous11 - 20nextend  jump to record: Search took 0.01 seconds. 
Linear prediciton and cepstral synthesis of speech signal in the TTS system
Mekyska, Jiří ; Stejskal, Vojtěch (referee) ; Smékal, Zdeněk (advisor)
This work deals with a linear prediction and cepstral synthesis of speech signal in the TTS (Text-to-Speech) systems with the opportunity of modeling the prosody. The work contains a description of speech signal in acoustic and phonetic plane, the principle of speech production and the way we can figure the speech signal in time and frequency domain. Next, there is the TTS block structure mentioned, whereas each block has its own detailed description. In the work, the modeling of prosody using the three most important suprasegmental features (fundamental tone, continuation and speech intensity) is also described. At the end of this work, there is a design and realization of universal Czech TTS system which is based on the speech synthesis in frequency domain. This system is implemented in program MATLAB.
Modification of Speech Rate
Kovářík, Aleš ; Schwarz, Petr (referee) ; Szőke, Igor (advisor)
This diploma thesis discusses modification of a speech rate. The PSOLA (Pitch Synchronous OverLap Add) method was used for the rate modification. This algorithm works in time domain. Another method -- phase vocoder, which works in frequency domain is also presented in an overview. This thesis extends the PSOLA method with a phoneme recognition, which allows for better understandability of the speech output by considering characteristics of the phonemes beeing pronounced. To examine this proposed method, an application connecting PSOLA and a phoneme recognizer was developed.
Database of vocal samples of human emotions
Hlavica, Michal ; Přinosil, Jiří (referee) ; Atassi, Hicham (advisor)
In this bachelor work is analyzed theory of emotions, how emotions arise and how they are physiologically expressed by human body. How these physiological expressions and emotions reflect into the human speech. Then is described process of creating of speech and basic prosodic and acoustic parameters relevant for research. Theory of creating of databases is described here as well, which is quality ground for database itself. The database is also part of this thesis and they are records cut from television programmes and serials. The next very important issue is description of software tool for subjective evaluating of databases, which was created as a part of this thesis. It was created in C++ language with help by compiler Builder C++ . Also a short analysis of exemplary records for every emotion is done here. This analysis deals with basic frequency, intensity and first three formants.
LPC Speech Coding
Zapletal, Ondřej ; Kyselý, František (referee) ; Rajmic, Pavel (advisor)
The contents of the thesis "LPC speech coding" are studies of this method of a parametric source coding, explanation of mathematical procedures that are used in it (linear prediction, autocorrelation, Levinson-Durbin algorithm, transfer to a form suitable for transmission, Chebyshev root searching polynomial method) and acquaintance with the signification and application of that method in real speech encoders. The task of the original project of this thesis is a description and simulation of a simple speech encoder based on LPC, which transforms a real speech signal into a bit flow, which contains all of the significant parameters for its backward reconstruction (LSF coefficients, pitch period, excitation level, voice detection - AMDF method). One part of this thesis is a discussion about currently used speech encoders.
Agreements and Disagreements between Automatic and Human Speaker Recognition
Valenta, Jakub ; Matějka, Pavel (referee) ; Rohdin, Johan Andréas (advisor)
Tato práce se zabývá problémem rozpoznáváním mluvčího. Uvedený pojem je definován a doplněn o jednotlivé metody, které s ním souvisí. Cílem práce je poukázat na shody a rozdíly mezi lidským a automatickým procesem rozpoznávání mluvčího. V úvodu práce jsou popsány teoretické poznatky z obou zmíněných oblastí, tj. na jaké aspekty lidské řeči se zaměřuje člověk, resp. automatický systém. Následně je provedeno několik experimentů, které mají za úkol srovnat tyto dvě metody. Tyto experimenty jsou vyhodnoceny tak, že je možné pozorovat, které testovací úlohy dokáže lépe vyřešit člověk, aby následně bylo možné tyto poznatky použít ke zlepšení funkce automatického systému. V závěru práce je takovýto návrh na zlepšení automatického systému předveden a otestován. Testování proběhlo úspěšně a byla zaznamenána vyšší přesnost při vyhodnocování. Takový výsledek tedy může být užitý v dalších výzkumech a umožnit tak další vývoj v oblasti automatického rozpoznávání mluvčích.
Set of JavaApplets Demonstrations for Speech Processing
Kudr, Michal ; Karafiát, Martin (referee) ; Černocký, Jan (advisor)
The goal of the thesis is being familiar with methods a techniques used in speech processing. Using the obtained knowledge I propose three JavaApplets demonstrating selected methods. In this thesis we can find the theoretical analysis of selected problems.
Identification of persons via voice imprint
Mekyska, Jiří ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
This work deals with the text-dependent speaker recognition in systems, where just a few training samples exist. For the purpose of this recognition, the voice imprint based on different features (e.g. MFCC, PLP, ACW etc.) is proposed. At the beginning, there is described the way, how the speech signal is produced. Some speech characteristics important for speaker recognition are also mentioned. The next part of work deals with the speech signal analysis. There is mentioned the preprocessing and also the feature extraction methods. The following part describes the process of speaker recognition and mentions the evaluation of the used methods: speaker identification and verification. Last theoretically based part of work deals with the classifiers which are suitable for the text-dependent recognition. The classifiers based on fractional distances, dynamic time warping, dispersion matching and vector quantization are mentioned. This work continues by design and realization of system, which evaluates all described classifiers for voice imprint based on different features.
Identification of significant spectral components in speach signal in stress
Dulesov, Egor ; Tučková, Jana (referee) ; Poměnková, Jitka (advisor)
The aim of this master’s thesis is to learn the problem of analysis and identification of significant spectral components in speech signal. Based on learning a special literature chooses the suitable methods of spectrum estimate. Does learning the literature in specification of testing of spectral components significate. Makes a procedure for identification of chosen speech formants. Does this procedure for audio signals both of in stress and in normal state. Estimates the results, compares efficiency of chosen methods and determine threshold for chosen formant of analyzed stress signal. States the recommendations for speech spectral analysis in stress situation.
Speech segmentation into phonemes
Andrla, Petr ; Balík, Miroslav (referee) ; Sysel, Petr (advisor)
The programme for the segmentation of a speech into fonems was created as a part of the bachelor´s thesis. This programme was made in the programme Matlab and consists of several scripts. The programme serves for automatic and hand segmentation. Automatic segmentation is based on the method of following symptom. The audiorecords were elaborated by the programme and a operation of the automatic segmentation was analysed. A detailed manual was created to the programme too. Individual used methods of the elaboration of a speech were in the bachelor´s thesis briefly descripted, its implementations in the programme and reasons of set of its parameters.
Automatic / Automated recogniton of emotional states based on utterance analysis
Pfeifer, Leon ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
The diploma thesis deals with the analysis of human emotional states. The thesis consists of three parts. The first part is charcterize, the process of speech generating, from phonetic and psychological poin of view. In the second part there are proccesed metods and contextual things.(preprocessing of signal, voice activity detector). For calculation fundamental Frequency it was used metod of central clipping, another used metod is formant frequency analyse and the last is metod of determinatin of nuber of thorns and planes. In the thirt part there are proccesesed results of measurements performed by particural metods. It was scorred five different emotional states: neutral, anger, happiness, sadness and surprise. At the end of this part there are discussed results for each metod.

National Repository of Grey Literature : 240 records found   previous11 - 20nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.