National Repository of Grey Literature 39 records found  beginprevious30 - 39  jump to record: Search took 0.00 seconds. 
Voice Conversion
Hodaň, David ; Novotný, Ondřej (referee) ; Černocký, Jan (advisor)
Voice conversion is the process of transformation of speech parameters belonging to one speaker in such a way that his/her speech sounds as spoken by someone else. This thesis presents a short summary of several techniques currently used for conversion. First, the theory of voice creation with an emphasis on key atributes that characterize and identify a speaker’s voice is described. Methods for voice modification are discussed, together with the advantages and pitfalls that predetermine the use-cases for suitable application of these methods. A high-level overview of how speech is transformed between the source and the target speakers is presented. This description is subsequently used to design a voice conversion system that is aimed to demonstrate one of the possible approaches to the conversion problem. The process of conversion consists of two phases: training and synthesis. As part of this project, a computer program for voice conversion based on the MATLAB programming environment has been developed. Its design, implementation and results are discussed.
Analysis of Parkinson's disease using segmental speech parameters
Mračko, Peter ; Mekyska, Jiří (referee) ; Smékal, Zdeněk (advisor)
This project describes design of the system for diagnosis Parkinson’s disease based on speech. Parkinson’s disease is a neurodegenerative disorder of the central nervous system. One of the symptoms of this disease is disability of motor aspects of speech, called hypokinetic dysarthria. Design of the system in this work is based on the best known segmental features such as coefficients LPC, PLP, MFCC, LPCC but also less known such as CMS, ACW and MSC. From speech records of patients affected by Parkinson’s disease and also healthy controls are calculated these coefficients, further is performed a selection process and subsequent classification. The best result, which was obtained in this project reached classification accuracy 77,19%, sensitivity 74,69% and specificity 78,95%.
Simple text-independent voice lock - speaker verification software system
Kotulek, Milan ; Dolenský,, Jan (referee) ; Staněk, Miroslav (advisor)
A brief introduction into biometrics is described in this thesis leading to description and to design a solution of verification system using speech analysis. The designed system provides firstly basic signal processing, then vowel recognition in fluent Czech speech. For each found vowel, observed speech features are calculated. The created GUI application was tested on created speaker database and its efficiency is approximately 54 % for short testing utterances, and approx. 88 % for long testing utterances respectively.
Determining person's height from spoken utterance
Pelikán, Pavel ; Mekyska, Jiří (referee) ; Atassi, Hicham (advisor)
Diploma’s thesis is focused on determining person’s height from spoken utterance. First part of the work evaluates present situation and refers to the published studies. Knowledge gained in these studies was used in this thesis. Study with the best results according to estimated height of the speakers was chosen. The experiment realized in the chosen study was performed in this work. The system for the estimation of the height of the speakers based on the speech signal was created. This system was successfully tested by using several acoustic features on spoken utterances from TIMIT database.
Multimedia signal processing
Staněk, Miroslav ; Pospíšil, Radek (referee) ; Sigmund, Milan (advisor)
The aim of this thesis is creation the appropriate multimedia support for signals and system with continuous time. The understanding of this issue is very important, because the obligatory subject Signals and systems, exactly BSIS, is taught at the EST bachelor degree. The understanding is also necessary prerequisite to successful understanding next topics in other related subjects. The next part of this thesis is focused on one dimension discrete signals. Concretely, the aim of this part is a realization of software system. Designed system has some basic operations (the signal energy, the number of signal zero crossing etc.) with sound files and also some advance functions e.g. vowel seeking and separating in fluent speech. The system is divided into two main parts. The first one analyzes sound files, creates the new sound file with wanted vowel and matrices with important parameters for other processing. The second program computes with given data, which statistically evaluates in other steps. The final system can be useful for speaker recognition, his emotional status etc.
Comparison of voice and audio codecs
Lúdik, Michal ; Sysel, Petr (referee) ; Míča, Ivan (advisor)
This thesis deals with description of human hearing, audio and speech codecs, description of objective measure of quality and practical comparison of codecs. Chapter about audio codecs consists of description of lossless codec FLAC and lossy codecs MP3 and Ogg Vorbis. In chapter about speech codecs is description of linear predictive coding and G.729 and OPUS codecs. Evaluation of quality consists of description of segmental signal-to- noise ratio and perceptual evaluation of quality – WSS and PESQ. Last chapter deals with description od practical part of this thesis, that is comparison of memory and time consumption of audio codecs and perceptual evaluation of speech codecs quality.
Logopedic defect analysis and recognition in speech utterances
Diviš, Jan ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor)
This bachelor's thesis deals with logopaedia mistake called dyslalie and its characteristics. I described the process creation and representation of speech. There are presented bases of processing and analyses speech signal ( LPC, cepstral, MFCC). I presented characteristics of speech and calculation of LPC, cepstral and Mel-frequency cepstral coefficients in the programme MATLAB. The bachelor's thesis includes problems of incorrect pronunciation sound "r" and "ř".
Decoder for key word detection system
Krotký, Jan ; Míča, Ivan (referee) ; Pfeifer, Václav (advisor)
The essay presents the basic characteristics of human speech recognition, describes systems for the detection of key words and further deals with the proposal of each decoder blocks divided into three chapters. The first one describes the operations that are performed before the signal distribution of the framework and the segmentation. The second chapter describes the calculation of short-term energy, the number of zero passes and self-correlative, prediction and Mel-frequency cepstral coefficients. The third chapter, which describes the design of the block decoder, describes the method of dynamic time destruction and the method based on hidden Markov model. The final part of the essay describes decoders working with a speech and a proposal for a simple decoder working with isolated words, which was based issued and tested based on the preceding chapters.
Automatic vocal-oriented recognition of human emotions
Houdek, Miroslav ; Přinosil, Jiří (referee) ; Atassi, Hicham (advisor)
This master thesis concerns with emotional states and gender recognition on the basis of speech signal analysis. We used various prosodic and cepstral features for the description of the speech signal. In the text we describe non-invasive methods for glottal pulses estimation. The described features of speech were implemented in MATLAB. For their classification we used the GMM classifier, which uses the Gaussian probability distribution for modeling a feature space. Furthermore, we constructed a system for recognition of emotional states of the speaker and a system for gender recognition from speech. We tested the success of created systems with several features on speech signal segments of various lengths and compared the results. In the last part we tested the influence of speaker and gender on the success of emotional states recognition.
Speech Recognition (digit)
Kantar, Martin ; Minář, Petr (referee) ; Matoušek, Radomil (advisor)
The aim of this diploma thesis is to explain what speech is and what are its constituents. I mention commonly used methods which are used for preparation of signals which we use for recognition. Schematic examples show principles of current recognizers of speech, their advantages and disadvantages. I made speech recognition program for 0-9 numerals in Matlab for neural nets learning.

National Repository of Grey Literature : 39 records found   beginprevious30 - 39  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.