keywords:"MFCC" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"MFCC"

Search:

Search Tips :: Advanced Search

Search collections:

Sort by:	Display results:	Output format:

	Simple text-independent voice lock - speaker verification software system Kotulek, Milan ; Dolenský,, Jan (referee) ; Staněk, Miroslav (advisor) A brief introduction into biometrics is described in this thesis leading to description and to design a solution of verification system using speech analysis. The designed system provides firstly basic signal processing, then vowel recognition in fluent Czech speech. For each found vowel, observed speech features are calculated. The created GUI application was tested on created speaker database and its efficiency is approximately 54 % for short testing utterances, and approx. 88 % for long testing utterances respectively. Detailed record
	Robust detection of keywords in speech signal Vrba, Václav ; Sysel, Petr (referee) ; Atassi, Hicham (advisor) The master thesis is divided into two parts theoretical and practical. The theoretical part is focused on methods of analysis and detection of speech signals. In the practical part the system for isolated word recognition was created in Matlab. The system is speaker independent separately for men and women. Also two speech databases were created for further use in the aircraft cockpit. Tests and evaluations were performed even with added noise. Detailed record
	Speech recognition using Sphinx-4 Kryške, Lukáš ; Uher, Václav (referee) ; Burget, Radim (advisor) This diploma thesis is aimed to find an effective method for continuous speech recognition. To be more accurate, it uses speech-to-text recognition for a keyword spotting discipline. This solution is able to be applicable for phone calls analysis or for a similar application. Most of the diploma thesis describes and implements speech recognition framework Sphinx-4 which uses Hidden Markov models (HMM) to define a language acoustic models. It is explained how these models can be trained for a new language or for a new language dialect. Finally there is in detail described how to implement the keyword spotting in the Java language. Detailed record
	Determining person's height from spoken utterance Pelikán, Pavel ; Mekyska, Jiří (referee) ; Atassi, Hicham (advisor) Diploma’s thesis is focused on determining person’s height from spoken utterance. First part of the work evaluates present situation and refers to the published studies. Knowledge gained in these studies was used in this thesis. Study with the best results according to estimated height of the speakers was chosen. The experiment realized in the chosen study was performed in this work. The system for the estimation of the height of the speakers based on the speech signal was created. This system was successfully tested by using several acoustic features on spoken utterances from TIMIT database. Detailed record
	Emotional State Recognition and Classification Based on Speech Signal Analysis Černý, Lukáš ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor) The diploma thesis focuses on classification of emotions. Thesis deals about parameterization of sounds files by suprasegment and segment methods with regard for next used of these methods. Berlin database is used. This database includes many of sounds records with emotions. Parameterization creates files, which are divided to two parts. First part is used for training and second part is used for testing. Point of interest is self-organization network. Thesis includes Matlab´s program which can be used for parameterization of any database. Data are classified by self-organization network after parameterization. Results of hits rates are presented at the end of this diploma thesis. Detailed record
	Logopedic defect analysis and recognition in speech utterances Diviš, Jan ; Atassi, Hicham (referee) ; Smékal, Zdeněk (advisor) This bachelor's thesis deals with logopaedia mistake called dyslalie and its characteristics. I described the process creation and representation of speech. There are presented bases of processing and analyses speech signal ( LPC, cepstral, MFCC). I presented characteristics of speech and calculation of LPC, cepstral and Mel-frequency cepstral coefficients in the programme MATLAB. The bachelor's thesis includes problems of incorrect pronunciation sound "r" and "ř". Detailed record
	Decoder for key word detection system Krotký, Jan ; Míča, Ivan (referee) ; Pfeifer, Václav (advisor) The essay presents the basic characteristics of human speech recognition, describes systems for the detection of key words and further deals with the proposal of each decoder blocks divided into three chapters. The first one describes the operations that are performed before the signal distribution of the framework and the segmentation. The second chapter describes the calculation of short-term energy, the number of zero passes and self-correlative, prediction and Mel-frequency cepstral coefficients. The third chapter, which describes the design of the block decoder, describes the method of dynamic time destruction and the method based on hidden Markov model. The final part of the essay describes decoders working with a speech and a proposal for a simple decoder working with isolated words, which was based issued and tested based on the preceding chapters. Detailed record
	Automatic vocal-oriented recognition of human emotions Houdek, Miroslav ; Přinosil, Jiří (referee) ; Atassi, Hicham (advisor) This master thesis concerns with emotional states and gender recognition on the basis of speech signal analysis. We used various prosodic and cepstral features for the description of the speech signal. In the text we describe non-invasive methods for glottal pulses estimation. The described features of speech were implemented in MATLAB. For their classification we used the GMM classifier, which uses the Gaussian probability distribution for modeling a feature space. Furthermore, we constructed a system for recognition of emotional states of the speaker and a system for gender recognition from speech. We tested the success of created systems with several features on speech signal segments of various lengths and compared the results. In the last part we tested the influence of speaker and gender on the success of emotional states recognition. Detailed record
	Computer analysis of sport matches Židlík, Pavel ; Balík, Miroslav (referee) ; Atassi, Hicham (advisor) This work deals with the possibility of a fast football match analysis from audio part of record with the possibility of implementation of some methods for other than football matches as well. The first intention was concentrated on detection of whiz of the soccer whistle that has specific frequency in its specter, which is out of common speech frequency. After detection harmonic frequency , the attention was focused on the definition of whiz meaning. Referee was helpful with the issue as he informed me about the number of whiz styles and provided me with referential samples for whiz classification. Neural network with back propagation was used for definition of whiz meaning. Another subject for detection of important moments of the match was concentration on the commentator’s basic tone. In case the commentator is really excited with the match, his basic speech tone automatically intensifies with every important action of the game. Analysis of commentator’s intensified basic speech tone was realized in this work too. Also the national hymns of teams playing against each other are a significant moment of the match. That is why detection of a hymn became another subject of analysis. Advantages of MFCC were used to obtain audio signal feature, from which 20 coefficients were gained. These were used as an entrance for classifier based on neural network with back propagation. For easy usage of these methods a graphic user interface with possibility of well-arranged look on gained results and also with possibility of replaying chosen section was created. Detailed record
	Speech Recognition (digit) Kantar, Martin ; Minář, Petr (referee) ; Matoušek, Radomil (advisor) The aim of this diploma thesis is to explain what speech is and what are its constituents. I mention commonly used methods which are used for preparation of signals which we use for recognition. Schematic examples show principles of current recognizers of speech, their advantages and disadvantages. I made speech recognition program for 0-9 numerals in Matlab for neural nets learning. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English