National Repository of Grey Literature 32 records found  1 - 10nextend  jump to record: Search took 0.00 seconds. 
Implementation of Waveshaper Audio Effect
Leitgeb, David ; Miklánek, Štěpán (referee) ; Schimmel, Jiří (advisor)
The aim of this thesis is the implementation of a non-linear audio effect called waveshaper. This type of distortion effect contains the following building blocks: user defined transfer function, several types of filters and an oversampling processor with multiple stages of oversampling. The first prototype of this audio effect was implemented using Matlab and its Audio Toolbox extension. Due to certain limitations of this prototype, the whole audio effect was later completely rewritten in C++. This new implementation uses the JUCE framework which is mainly used for audio application development. The transition to this framework allowed real time editing of the transfer function and a VST3 build of the effect. In addition to a brief introduction of the used system types, motivation for oversampling and the description of the implementation for both prototypes, this thesis also includes graphical examples demonstrating their correct functionality. Audio files related to these examples are included in the electronic attachment.
Music information retrieval techniques for determining the place of origin of the Czech chamber and orchestral music interpretations
Miklánek, Štěpán ; Mekyska, Jiří (referee) ; Kiska, Tomáš (advisor)
This diploma thesis is focused on the statistical analysis of chamber and orchestral classical music recordings composed by Czech authors. One of the chapters is dedicated to the description of a feature extraction process that precedes the statistical analysis. Techniques of Music Information Retrieval are used during several stages of this thesis. Databases used for analysis are described and pre-processing steps are proposed. A tool for synchronization of the recordings was implemented in MATLAB. Finally the system used for classification of recordings based on their geographical origin is proposed. The recordings are sorted by a binary classifier into two categories of Czech and world recordings. The first part of the statistical analysis is focused on individual analysis of features. The features are evaluated based on their discrimination strength. The second part of the statistical analysis is focused on feature selection, which can improve the overall accuracy of the binary classifier compared to the individual analysis of the features.
Audio signal modelling using neural networks
Pešán, Michele ; Ištvánek, Matěj (referee) ; Miklánek, Štěpán (advisor)
Neuronové sítě vycházející z architektury WaveNet a sítě využívající rekurentní vrstvy jsou v současnosti používány jak pro syntézu lidské řeči, tak pro „black box“ modelování systémů pro úpravu akustického signálu – modulační efekty, nelineární zkreslovače apod. Úkolem studenta bude shrnout dosavadní poznatky o možnostech využití neuronových sítí při modelování akustických signálů. Student dále implementuje některý z modelů neuronových sítí v programovacím jazyce Python a využije jej pro natrénování a následnou simulaci libovolného efektu nebo systému pro úpravu akustického signálu. V rámci semestrální práce vypracujte teoretickou část práce, vytvořte zvukovou databázi pro trénování neuronové sítě a implementujte jednu ze struktur sítí pro modelování zvukového signálu. Neuronové sítě jsou v průběhu posledních let používány stále více, a to víceméně přes celé spektrum vědních oborů. Neuronové sítě založené na architektuře WaveNet a sítě využívající rekurentních vrstev se v současné době používají v celé řadě využití, zahrnující například syntézu lidské řeči, nebo napřklad při metodě "black-box" modelování akustických systémů, které upravují zvukový signál (modulačí efekty, nelineární zkreslovače, apod.). Tato akademická práce si dává za cíl poskytnout úvod do problematiky neuronových sítí, vysvětlit základní pojmy a mechanismy této problematiky. Popsat využití neuronových sítí v modelování akustických systémů a využít těchto poznatků k implementaci neuronových sítí za cílem modelování libovolného efektu nebo zařízení pro úpravu zvukového signálu.
Design and Realization of Acoustic Adjustments of Listening Room
Vajda, Michal ; Miklánek, Štěpán (referee) ; Schimmel, Jiří (advisor)
This bachelor thesis is devoted to the design of acoustic modifications of residential space and the subsequent implementation of these modifications. Its aim is to design an optimal solution of the space for listening to music in accordance with the Czech Technical Standards. The basic parameters investigated are reverberation time and acoustic quality criteria. The thesis deals with simulation in the EASE modelling program and addresses situations before and after acoustic treatment. The following methods and procedures are used to comprehensively address the above mentioned treatments and to achieve the set objective. Furthermore, the simulation is verified with a custom calculator in Excel. After using the absorbers suitable for adjustment, the implementation with the final measurement of the reverberation time is carried out. Finally, an additional modification design is created using Helmholtz resonators. The implemented methods achieve an optimal and frequency balanced reverberation time and the room can be used as a mixing hall for multi-channel sound recording.
Interactive presentation of audiovisual works in virtual reality
Abdulvaliyev, Chamit ; Miklánek, Štěpán (referee) ; Sikora, Pavel (advisor)
This bachelor thesis is focused on development of a VR application which allows presentation of audiovisual works in a virtual space. This thesis was created in collaboration with Vasulka Kitchen Brno as part of the project Vasulka Live Archive / Interfaces and uses works of Steina and Woody Vasulkas. The application allows playing videos on a virtual canvas, rewinding videos, visualizing predictions generated by a neural network, changing the sorting of the thumbnails and changing the virtual environment. The prediction and sorting data are loaded from JSON files provided by the supervisor of the thesis. The videos are loaded from local storage before being played. Predictions are visualized with generated textures for every tag. The user has access to two environments, outer space and gallery, which was modelled after the exhibition space Vasulka Kitchen Brno. In the outer space environment, the user can move using the joysticks on the controllers, which simulates flight with a jet pack. Video rewinding and changing of the video sorting is controllable using the buttons on the controllers. The game engine Unity was used during the development and the application was tested on the VR headset Oculus Quest 2. The text of this bachelor thesis is divided in two parts. In the first part, the basic VR development context and game engines which can be used are described. These game engines are described: Godot, PlayCanvas and Unity. The second part focuses on the development of the VR application and the results of the bachelor thesis. It details the process of preparing a virtual scene and working with Unity editor. Also it explains the functions of the used scripts and ways of solving the problems met during the development.
Sound records comparison using timbre features
Miklánek, Štěpán ; Schimmel, Jiří (referee) ; Kiska, Tomáš (advisor)
This thesis deals with research of musical features, which are describing music recordings relating to timbre. First chapter deals with historical development and modern approach in a discipline called Music Information Retrieval (MIR), further there is a description of music processing from the perspective of music theory and digital signal processing. Then followed by a description of signal pre-processing. This part is very important when retrieving features from music recordings. In chapter concerned about retrieving features there are summarized all common features used when retrieving information from musical recordings with main concern to timbral features. A database of music recordings and a feature retrieving system is introduced. The last chapter deals with individual analysis of timbral features.
Automatic image classification
Ševčík, Zdeněk ; Miklánek, Štěpán (referee) ; Sikora, Pavel (advisor)
The aim of this thesis is to explore clustering algorithms of machine unsupervised learning, which can be used for image database classification by similarity. For chosen clustering algorithms is written up a theoretical basis. For better classification of used database this thesis deals with different methods of image preprocessing. With these methods the features from image are extracted. Next the thesis solves of implementation of preprocessing methods and practical application of clustering algorithms. In practical part is programmed aplication in Python programming language, which classifies the database of images into classes by similarity. The thesis tests all of used methods and at the end of the thesis is processed searches of results.
A tool for simultaneous playback of multiple composition interpretations
Švejcar, Michael ; Ištvánek, Matěj (referee) ; Miklánek, Štěpán (advisor)
The purpose of this Bachelor’s thesis was to create a piece of software which enables the user to simultaneously play back multiple interpretations of a musical piece and switch between them instantaneously. This was achieved using the App Designer in the MATLAB environment, which is intended for developing applications with graphical user interface. The key to the development of the application was especially the use of available toolboxes and algorithms for computing chromagrams and multiscale dynamic time warping. The final IntSwitcher player enables the user to load two recordings of interpretations of one song. Chromagrams which characterize the individual recordings in terms of tonal development over time are first calculated from the input files. After that, the multiscale dynamic time warping method is applied on the chromagrams, which outputs the warping path. The warping path in this case is a matrix, in which musically corresponding samples of loaded audio files are assigned together with the resolution of 50 ms. From this, the corresponding time position of currently inactive track is computed along with its slider position. If the user switches the currently played recording, the second track starts playing in the same part of composition, even if that part is at a different time in each of the individual recordings. The final software is an appropriate tool for studying differences between various interpretations of the same musical piece.
Beat tracking systems for music recordings
Staňková, Karolína ; Miklánek, Štěpán (referee) ; Ištvánek, Matěj (advisor)
This master thesis deals with systems for detecting rhythmic structures of music recordings. The field of Music Information Retrieval (MIR) allows us to examine the harmonic and tonal properties of music, rhythm, tempo, etc., and has uses in academic and commercial sphere. Various algorithms are used in the detection of rhythmic structures. However, today, most new methods use neural networks. This work aims to summarize the current research results of systems for detecting beats and tempo, to describe methods of calculating and evaluating the parameters of music recordings, and to implement a program that allows comparison of available detection systems. The result of the work is a script in the Python language, which uses six different systems to detect the rhythmic structure of test recordings. It then checks the outputs of the algorithms according to the given reference and compares the given systems with each other using several evaluation values. It uses two datasets as a reference—one of them is publicly available and the other was created by the author of this thesis (including annotations, i.e., reference beat times, for the sample recordings). The program allows user to see the results in graphs and play any of the sample recordings with detected beat times.
Extraction of parameters for the research of music performance
Laborová, Anna ; Miklánek, Štěpán (referee) ; Ištvánek, Matěj (advisor)
Different music performances of the same piece may significantly differ from each other. Not only the composer and the score defines the listener’s music experience, but the music performance itself is an integral part of this experience. Four parameter classes can be used to describe a performance objectively: tempo and timing, loudness (dynamics), timbre, and pitch. Each of the individual parameters or their combination can generate a unique characteristic performance. The extraction of such objective parameters is one of the difficulties in the field of Music Performance Analysis and Music Information Retrieval. The submitted work summarizes knowledge and methods from both of the fields. The system is applied to extract data from 31 string quartet performances of 2. movement Lento of String Quartet no. 12 F major (1893) by czech romantic composer Antonín Dvořák (1841–1904).

National Repository of Grey Literature : 32 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.