keywords:"MFCC" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"MFCC"

Search:

Search Tips :: Advanced Search

Search collections:

Sort by:	Display results:	Output format:

	Robot with autonomous audio-video control Dvořáček, Štěpán ; Mašek, Jan (referee) ; Přinosil, Jiří (advisor) This thesis describes the design and realization of a mobile robot with autonomous audio-visual control. This robot is able of movement based on sensors consisting of camera and microphone. The mechanical part consists of components made with 3D print technology and omnidirectional Mecanum wheels. Software utilizes OpenCV library for image processing and computes MFCC a DTW for voice command detection. Detailed record
	Estimation of formant frequencies using machine learning Káčerová, Erika ; Galáž, Zoltán (referee) ; Mekyska, Jiří (advisor) This Master's thesis deals with the issue of formant extraction. A system of scripts in Matlab interface is created to generate values of the first three formant frequencies from speech recordings with the use of Praat and Snack(WaveSurfer). Mel Frequency Cepstral Coefficients and Linear Predictive Coefficients are extracted from the audio files in order to be added to the database. This database is then used to train a neural network. Finally, the designed neural network is tested. Detailed record
	Controlling and Measuring Sport Drills by Voice/Sound Odehnal, Jiří ; Křivka, Zbyněk (referee) ; Rychlý, Marek (advisor) This master's thesis deals with the design and development of mobile aplication for Android platform. The aim of the work is to implement a simple and user-friendly user interface that would support and assist the user in trainning and sport exercises. The thesis also include implementation of sound detection to support during exercises and voice instruction by application. In practice the application should help in making training exercises more comfortable without the user being forced to keep mobile device in hand. Detailed record
	Acoustic Scene Classification from Speech Dobrotka, Matúš ; Glembek, Ondřej (referee) ; Matějka, Pavel (advisor) The topic of this thesis is an audio recording classification with 15 different acoustic scene classes that represent common scenes and places where people are situated on a regular basis. The thesis describes 2 approaches based on GMM and i-vectors and a fusion of the both approaches. The score of the best GMM system which was evaluated on the evaluation dataset of the DCASE Challenge is 60.4%. The best i-vector system's score is 68.4%. The fusion of the GMM system and the best i-vector system achieves score of 69.3%, which would lead to the 20th place in the all systems ranking of the DCASE 2017 Challenge (among 98 submitted systems from all over the world). Detailed record
	Speech-signal-based recognition of type of transmission channel Kopřiva, Tomáš ; Burget, Radim (referee) ; Atassi, Hicham (advisor) This work deals with the classification of five different transmission channels by speech signal processing. The channels considered are: GSM, two PSTN channels and two VoIP channels. For the training and testing purposes, a speech database for the transmission channels called SPLAB_TranCh was constructed. The speech signals of this corpus originally come from well-known TIMIT database, where each utterance passed through each mentioned transmission channel. The main objective of this work is to find optimal features and classification accuracy that yield best classification accuracy. Several types of features, including MFCC, LPCC and spectral characteristics were put under examination. The best suprasegmental features were identified by using mRMR algorithm. Several classifiers were tested as well. The results suggested that the classification of transmission channel can be performed with high accuracy (around 90 %). Influence of adverse effects, which can occur during transmission, is also examined. Considered types of distortions are: saturation, thresholding, echo, crackling noises and different colors of noises and filters. Detailed record
	System for speaker diarization Bradáč, Josef ; Atassi, Hicham (referee) ; Míča, Ivan (advisor) Speaker diarization system has wide application in the field of processing and analysis speech signals. This work is broken down to introduction and follow for designing the system. Result of this work is an implementation of the system itself and its evaluation based on interview´s database. Detailed record
	Application of statistical analysis of speech in patients with Parkinson's disease Bijota, Jan ; Mžourek, Zdeněk (referee) ; Galáž, Zoltán (advisor) This thesis deals with speech analysis of people who suffer from Parkinson’s disease. Purpose of this thesis is to obtain statistical sample of speech parameters which helps to determine if examined person is suffering from Parkinson’s disease. Statistical sample is based on hypokinetic dysarthria detection. For speech signal pre-processing DC-offset removal and pre-emphasis are used. The next step is to divide signal into frames. Phonation parameters, MFCC and PLP coefficients are used for characterization of framed speech signal. After parametrization the speech signal can be analyzed by statistical methods. For statistical analysis in this thesis Spearman’s and Pearson’s correlation coefficients, mutual information, Mann-Whitney U test and Student’s t-test are used. The thesis results are the groups of speech parameters for individual long czech vowels which are the best indicator of the difference between healthy person and patient suffering from Parkinson’s disease. These result can be helpful in medical diagnosis of a patient. Detailed record
	Speech Recognition Algorithms in FPGA/DSP Urbiš, Oldřich ; Herout, Adam (referee) ; Szőke, Igor (advisor) This master's thesis deals with design of speech recognition algorithms with consideration of target technology, which is platform combinating digital signal processing and field programmable gate array. Algorithms for speech recognition includes: feature extraction of Melfrequency cepstral coefficients, hidden Markov models and their evaluation by Viterbi algorithm. Detailed record
	Multiplatform Application for Speaker Verification Görig, Jan ; Matějka, Pavel (referee) ; Glembek, Ondřej (advisor) Bachelor thesis considers speaker recognition without knowledge of spoken message. There are described current feature extraction methods and their evaluation using Gaussian mixture model. The practical output of this work is application for visualization of the recognition process. Developed application is cross platform and it uses Qt and BSAPI libraries. Detailed record
	Speech Recognition For Selected Languages Schmitt, Jan ; Karafiát, Martin (referee) ; Janda, Miloš (advisor) This bachelor's thesis deals with recognition of continues speech for three languages - Bulgarian, Croatian and Swedish. There are described basics of speech processing and recognition methods like acoustic modeling using hidden Markov models and gaussian mixture models. Another aim of this work is preparing data for those languages from GlobalPhone database, so they may be used with speech recognition toolkits Kaldi and HTK. With data prepared there are several models trained and tested using Kaldi toolkit. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English