National Repository of Grey Literature 22 records found  1 - 10nextend  jump to record: Search took 0.01 seconds. 
The relation of emotions and intonation curves
Gavlasová, Radka ; Smékal, Zdeněk (referee) ; Tučková,, Jana (advisor)
This thesis deals with intonation curves and their relation to human emotions. Besides the theoretical part where you can learn about speech production, signal processing and psychological distribution of emotions, there is also a unique database recorded with the help of two professional actors. The main goal of this thesis is to classify created data using artificial neural networks into four classes. Those classes are anger, joy, boredom and sadness. The practical part was implemented in a programming platform called Matlab using Classification Learner app. Features used for this method were variations of fundamental frequency and MFCC. The results were compared with a listening survey so that it could be determined whether the results provided by neural network are relevant to some kind of a human factor. Success rate of the trained models reached 82 %, new data testing reached 75 %. Listening survey confirmed that the results correspond to the assumption of human perception. Better success rate would be accomplished by using a bigger set of higher quality data.
Speech signal processing in time domain
Marko, Ján ; Staněk, Miroslav (referee) ; Sigmund, Milan (advisor)
The bachelor´s project deals with the processing of speech signals. The work includes a search of available publications on the issue of determining the voiced segments of speech and fundamental frequency. Attention is devoted to methods for speech recognition features in the time domain. Comparing theoretical and practical results on the real speech signals evaluates the use of methods, highlighting their advantages and disadvantages.
Voice Analysis for Detection of Diseases
Chytil, Pavel ; Sigmund, Milan (advisor)
Tato disertační práce je zaměřena na analýzu řečového signálu za učelem detekce nemocí ovlivňujících strukturu hlasových orgánů, obzvláště těch, které mění strukturální character hlasivek. Poskytnut je přehled současných technik. Dále jsou popsány zdroje použitých nahrávek pro zdravé a nemocné mlučí. Hlavním učelem této disertační práce je popsat vypočetní postup k odhadu parametrů modelu hlasového zdroje, které umožní následnou detekci a klasifikaci nemocí hlasivek. Poskytujeme detailní popis analýzy řečových signálů, které mohou být odvozeny z parametrických modelů hlasivek.
Real-Time Analysis of Audio Signals
Řezáč, Martin ; Schimmel, Jiří (referee) ; Černocký, Jan (advisor)
The goal of this thesis is creation an application, which can perform real-time fundamental frequency tracking of incoming audio samples. According to detected frequencies, the program generates MIDI messages, which are sent to chosen MIDI device. First, the reader is introduced to the issue of fundamental frequency tracking. The following part describes individual methods, especially the one based on spectral analysis of a tone. A description of used technologies is also a included in this part of this thesis. In the following part, the implementation and testing of application are described including opinions of several musicians about this product. At the end, the whole work is concluded and the possible further development is outlined.
Determining person's height from spoken utterance
Pelikán, Pavel ; Mekyska, Jiří (referee) ; Atassi, Hicham (advisor)
Diploma’s thesis is focused on determining person’s height from spoken utterance. First part of the work evaluates present situation and refers to the published studies. Knowledge gained in these studies was used in this thesis. Study with the best results according to estimated height of the speakers was chosen. The experiment realized in the chosen study was performed in this work. The system for the estimation of the height of the speakers based on the speech signal was created. This system was successfully tested by using several acoustic features on spoken utterances from TIMIT database.
Analysis of prosodic and spectral properties of voice communication in air traffic control
Simonides, Jakub ; Kopřiva, Tomáš (referee) ; Smékal, Zdeněk (advisor)
This thesis analyses the prosodic and spectral features of bi-directional air traffic control communication, describes how to communication was split to segments, according to the source, via transcription. After the splitting, the segments are deeply analyzed for their spectral and prosodic features. The analysis itself, focuses on the spectral aspects of intensity, fundamental frequency F0, slope and centroid. Additionally, tempo and voice activity detection data were measured, to support the spectral aspects as well. Because of the differences between the ATC controller’s and pilots’ spectral aspects, the direction of the communication can be automatically determined, with relatively high success percentage.
Detection of the voice fundamental frequency
Chloupek, Jiří ; Mekyska, Jiří (referee) ; Sysel, Petr (advisor)
This bachelor thesis deals with the detection of the pitch man. The frequency of the basic tone is one of the basic parameters of speech signal in the frequency domain. In this thesis we describe several methods for pitch detection and practical application of correlation method and cepstral analysis.
Detecting Stress in Speech
Šoltés, Samuel ; Beneš, Karel (referee) ; Grézl, František (advisor)
Stress influences people in several ways and can lead to decrease in performance and / or critical mistakes. Stress detection in speech measures the influence of stress in speech. The goal of this thesis is to offer a closer look at the impacts of stress, choose adequate parameters of speech which would manifest these impacts, implement their estimation and compare their results. The thesis contains description of stress and its effects on humans; glottal pulse, spectrum, fundamental frequency and formants as the parameters chosen for stress estimation; design and implementation of parameter value estimation from speech signal and obtained values of given parameters on two different databases.
Web applications supporting education of audio signals generation and processing
Tkadlec, Vojtěch ; Schimmel, Jiří (referee) ; Rajmic, Pavel (advisor)
The main goal of the diploma thesis was the creation of 3 web applications for interactive support of studying courses in the area of digital signal processing using the programming language JavaScript, the markup language HTML, and the Web Audio API interface. The topics included additive synthesis, ADSR time envelope, amplitude modulation, and approximation of 1D signals using discrete cosine transform. The written part of the thesis also focuses on the tools used for the practical part of the thesis. For better understanding of the topics, four web applications were created, and a separate web application was created specifically for the topic of ADSR envelope.
Analysis of prosodic and spectral properties of voice communication in air traffic control
Simonides, Jakub ; Kopřiva, Tomáš (referee) ; Smékal, Zdeněk (advisor)
This thesis analyses the prosodic and spectral features of bi-directional air traffic control communication, describes how to communication was split to segments, according to the source, via transcription. After the splitting, the segments are deeply analyzed for their spectral and prosodic features. The analysis itself, focuses on the spectral aspects of intensity, fundamental frequency F0, slope and centroid. Additionally, tempo and voice activity detection data were measured, to support the spectral aspects as well. Because of the differences between the ATC controller’s and pilots’ spectral aspects, the direction of the communication can be automatically determined, with relatively high success percentage.

National Repository of Grey Literature : 22 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.