National Repository of Grey Literature 42 records found  previous11 - 20nextend  jump to record: Search took 0.01 seconds. 
The analysis of the clarinet spectrum from different manufacturers
Suchánek, Tomáš ; Mojdl, Edgar (referee) ; Jirásek, Ondřej (advisor)
Bachelor’s thesis focuses on spectral analysis of six B clarinets made by manufacturers Buffet Crampon, RZ Woodwind Manufacturing and Yamaha. Instruments were tested by two professional musicians with different timbral preferences and the resulting spectrum is then applied to how psychoacoustic measurements define timbre perception. Furthermore, the impact of different dynamics or reeds is discussed and significant part of analysis also describes directivity patterns of individual higher harmonics or characteristic formant areas.
Decoder for key word detection system
Krotký, Jan ; Míča, Ivan (referee) ; Pfeifer, Václav (advisor)
The essay presents the basic characteristics of human speech recognition, describes systems for the detection of key words and further deals with the proposal of each decoder blocks divided into three chapters. The first one describes the operations that are performed before the signal distribution of the framework and the segmentation. The second chapter describes the calculation of short-term energy, the number of zero passes and self-correlative, prediction and Mel-frequency cepstral coefficients. The third chapter, which describes the design of the block decoder, describes the method of dynamic time destruction and the method based on hidden Markov model. The final part of the essay describes decoders working with a speech and a proposal for a simple decoder working with isolated words, which was based issued and tested based on the preceding chapters.
Simple text-independent voice lock - speaker verification software system
Kotulek, Milan ; Dolenský,, Jan (referee) ; Staněk, Miroslav (advisor)
A brief introduction into biometrics is described in this thesis leading to description and to design a solution of verification system using speech analysis. The designed system provides firstly basic signal processing, then vowel recognition in fluent Czech speech. For each found vowel, observed speech features are calculated. The created GUI application was tested on created speaker database and its efficiency is approximately 54 % for short testing utterances, and approx. 88 % for long testing utterances respectively.
Comparison of voice and audio codecs
Lúdik, Michal ; Sysel, Petr (referee) ; Míča, Ivan (advisor)
This thesis deals with description of human hearing, audio and speech codecs, description of objective measure of quality and practical comparison of codecs. Chapter about audio codecs consists of description of lossless codec FLAC and lossy codecs MP3 and Ogg Vorbis. In chapter about speech codecs is description of linear predictive coding and G.729 and OPUS codecs. Evaluation of quality consists of description of segmental signal-to- noise ratio and perceptual evaluation of quality – WSS and PESQ. Last chapter deals with description od practical part of this thesis, that is comparison of memory and time consumption of audio codecs and perceptual evaluation of speech codecs quality.
Multimedia signal processing
Staněk, Miroslav ; Pospíšil, Radek (referee) ; Sigmund, Milan (advisor)
The aim of this thesis is creation the appropriate multimedia support for signals and system with continuous time. The understanding of this issue is very important, because the obligatory subject Signals and systems, exactly BSIS, is taught at the EST bachelor degree. The understanding is also necessary prerequisite to successful understanding next topics in other related subjects. The next part of this thesis is focused on one dimension discrete signals. Concretely, the aim of this part is a realization of software system. Designed system has some basic operations (the signal energy, the number of signal zero crossing etc.) with sound files and also some advance functions e.g. vowel seeking and separating in fluent speech. The system is divided into two main parts. The first one analyzes sound files, creates the new sound file with wanted vowel and matrices with important parameters for other processing. The second program computes with given data, which statistically evaluates in other steps. The final system can be useful for speaker recognition, his emotional status etc.
Analysis of Parkinson's disease using segmental speech parameters
Mračko, Peter ; Mekyska, Jiří (referee) ; Smékal, Zdeněk (advisor)
This project describes design of the system for diagnosis Parkinson’s disease based on speech. Parkinson’s disease is a neurodegenerative disorder of the central nervous system. One of the symptoms of this disease is disability of motor aspects of speech, called hypokinetic dysarthria. Design of the system in this work is based on the best known segmental features such as coefficients LPC, PLP, MFCC, LPCC but also less known such as CMS, ACW and MSC. From speech records of patients affected by Parkinson’s disease and also healthy controls are calculated these coefficients, further is performed a selection process and subsequent classification. The best result, which was obtained in this project reached classification accuracy 77,19%, sensitivity 74,69% and specificity 78,95%.
Comparison of spectrum and directional dharacteristics of double reed instruments
Cočev, Jiří ; Buzzi, Mario (referee) ; Jirásek, Ondřej (advisor)
The Bachelor's thesis deals with analysis of the sound signals of double reed wind musical instruments. Research focuses mainly on properties of the reed and how properties of reed affects the overall sound of the instrument. For the description of the signals were used FFT, LPC, Autocorrelation and Cepstral analysis. In conclusion, the thesis offers possible future direction for the experimental research of double reed wind musical instruments.
Estimation of formant frequencies using machine learning
Káčerová, Erika ; Galáž, Zoltán (referee) ; Mekyska, Jiří (advisor)
This Master's thesis deals with the issue of formant extraction. A system of scripts in Matlab interface is created to generate values of the first three formant frequencies from speech recordings with the use of Praat and Snack(WaveSurfer). Mel Frequency Cepstral Coefficients and Linear Predictive Coefficients are extracted from the audio files in order to be added to the database. This database is then used to train a neural network. Finally, the designed neural network is tested.
Codec Detection from Speech
Jon, Josef ; Matějka, Pavel (referee) ; Černocký, Jan (advisor)
Tato práce se zabývá detekcí kodeků z komprimovaného řečového signálu. Cílem bylo zjistit, jaké charakteristiky rozlišují jednotlivé kodeky a následně vytvořit prostředí vhodné pro experimenty s různými typy a konfiguracemi klasifikátorů. Použity byly Support vector machines a především neuronové sítě, které byly vytvořeny pomocí nástroje Keras. Hlavním přínosem této práce je experimentální část, ve které je analyzován vliv různých parametrů neuronové sítě. Po nalezení nejvhodnější kombinace parametrů dosáhla síť přesnosti klasifikace přes 98% na testovací sadě obsahující data z 6 kodeků.
Determining person's height from spoken utterance
Pelikán, Pavel ; Mekyska, Jiří (referee) ; Atassi, Hicham (advisor)
Diploma’s thesis is focused on determining person’s height from spoken utterance. First part of the work evaluates present situation and refers to the published studies. Knowledge gained in these studies was used in this thesis. Study with the best results according to estimated height of the speakers was chosen. The experiment realized in the chosen study was performed in this work. The system for the estimation of the height of the speakers based on the speech signal was created. This system was successfully tested by using several acoustic features on spoken utterances from TIMIT database.

National Repository of Grey Literature : 42 records found   previous11 - 20nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.