|
Research of dynamics features comparing audio records
Zemánková, Šárka ; Smékal, Zdeněk (referee) ; Kiska, Tomáš (advisor)
This work deals with the analysis of parameters related to the dynamics of sound recordings. It contains a brief description of the history of sound processing in analogue and digital form and the process of audio signal processing nowadays. The following chapter includes selection of the most suitable parameters for describing an audio recording, especially those describing the dynamics. This work further characterizes the methods used in similar researches in the world. There is also a system designed to calculate 43 dynamic parameters and the possibilities of their analysis are outlined as well. 35 different interpretations of one musical work were compared. Finally, the calculated parameters were drawn into scatter plots and evaluated using visual cluster analysis.
|
|
Sound records comparison using timbre features
Miklánek, Štěpán ; Schimmel, Jiří (referee) ; Kiska, Tomáš (advisor)
This thesis deals with research of musical features, which are describing music recordings relating to timbre. First chapter deals with historical development and modern approach in a discipline called Music Information Retrieval (MIR), further there is a description of music processing from the perspective of music theory and digital signal processing. Then followed by a description of signal pre-processing. This part is very important when retrieving features from music recordings. In chapter concerned about retrieving features there are summarized all common features used when retrieving information from musical recordings with main concern to timbral features. A database of music recordings and a feature retrieving system is introduced. The last chapter deals with individual analysis of timbral features.
|
| |
|
Reconstruction of signal modified by fade-in/fade-out
Bača, Petr ; Kiska, Tomáš (referee) ; Rajmic, Pavel (advisor)
This thesis contains the theory needed to solve the special problem of bit-depth expansion. The goal is to reconstruct the signal which suffered from application of the fade-in, fade-out effect. The theory includes information of analog to digital conversion and the theory of sparse representations. Thesis formulates the task of bit-depth expansion and advices the algorithm to solve it. Furthermore, the realization of the issue is discussed and the results are given.
|
|
An alternative JPEG coder/decoder
Jirák, Jakub ; Kiska, Tomáš (referee) ; Rajmic, Pavel (advisor)
The JPEG codec is currently the most widely used image format. This work deals with the design and implementation of an alternative JPEG codec using proximal algorithms in combination with the fixation of points from the original image to suppression of artifacts created in common JPEG coding. To solve the problem, the prox_TV and then the Douglas-Rachford algorithm were used, for which special functions using l_1-norm for image reconstruction were derived. The results of the proposed solution are very good because they can effectively suppress the artefacts created and the result corresponds to the image with a higher set qualitative factor. The proposed method achieves very good results for both simple images and photos, but in the case of large images (1024 × 1024 px) and larger, a large amount of computing time is required, so the method is more suitable for smaller images.
|
|
Application for the calculation of speech features describing hypokinetic dysarthria
Hynšt, Miroslav ; Mekyska, Jiří (referee) ; Kiska, Tomáš (advisor)
This thesis is about design and implementation of application for computing speech parameters on people with Parkinson disease. At the beginning is generaly described Parkinson disease and Hypokinetic dysarthria and how it affects the speech and speech parameters when it occurs. Mainly there are described areas of speech like phonation, prosody, articulation and fluent speech. As a part of next topic this thesis describes specific speech parameters with bigger meaning during diagnosis Parkinson disease and it's progress over the time. There are also mentioned few significant studies dealing with examination of speech of the subjects with diagnoses of Parkinson disease and computing some speech parameters in order to analyze their speech impairments. Part of the thesis is description of implemented standalone application for calculating, exporting and visualizing of speech parameters from selected sound records.
|
| |
|
Platform for subjective evaluation of video-sequences
Srnec, Tomáš ; Kiska, Tomáš (referee) ; Číka, Petr (advisor)
This bachelor thesis is focused on subjective video quality assessment. Used modern codecs such as H.264, H.265, VP8 and VP9 are described in first chapter. In the next part of the thesis, four methods of the subjective video assessment are being called, according to Recommendation ITU-T P.910. The practical part includes encoding of three videos, into four resolutions, for four codecs. Output of the thesis is JavaFX application, capable of playing used videos for participants of test, who are making judgment. Their results are real-time sent to MySQL server and directly in application evaluated into bar charts. According to our results, the best codec is VP9, before codec H.265, H.264 and VP8.
|
|
Acoustic analysis of gender-related patterns in Parkinson's disease
Herinek, Denis ; Kiska, Tomáš (referee) ; Galáž, Zoltán (advisor)
The bachelor's thesis is about acoustic analysis of gender-related patterns in Parkinson's disease by analysing speech task: reading passage. Parkinson's disease manifests in all subsystems involved in speech production (respiration, phonation, articulation and prosody). The aim of this thesis is familirization with symptoms of this disorder and speech parameters influenced by this disorder. Thesis contains preprocessing, parametrization of speech signal and statistic analysis of parameters. System of speech signal processing is created in MATLAB programming language.
|
|
De-identification of speakers with hypokinetic dysarthria
Kárník, Radoslav ; Kiska, Tomáš (referee) ; Mekyska, Jiří (advisor)
This paper discuses design and implementation of a system that performs de-identification of speech recordings of patients suffering from Parkinson's disease. The paper describes causes and symptoms of Parkinson's disease and effects of hypokinetic dysarthria on speech. Part of the paper is devoted to speech features that can be used for diagnosing hypokinetic dysarthria from speech. It also describes ways of speech de-identification and system for evaluating results using recognition of speakers and patients. De-identification system uses vocal tract length normalization (VTLN) and evaluating system uses Gaussian mixture models (GMM). PARCZ database was used for testing. It contains recordings of speech of patients affected by Parkinson's disease and control speakers.
|