National Repository of Grey Literature 36 records found  beginprevious23 - 32next  jump to record: Search took 0.01 seconds. 
Exploring the Possibilities of Automated Annotation of Classical Music with Abrupt Tempo Changes
Ištvánek, Matěj ; Miklánek, Štěpán
In this paper, we introduce options for automatic measure detection based on synchronization, beat detection, and downbeat detection strategy. We evaluate proposed methods on two motifs from the dataset of Leos Janacek's string quartet music. We use specific user-driven metrics to capture annotation efficiency simulating a scenario where a musicologist has to use the output of an automated system to create ground-truth annotations on given recordings. In the case of the first motif, synchronization outperformed other methods by detecting most of the measure positions correctly. This procedure was also the most suitable for the second motif—it achieved a low number of correct detections, but the vast majority of transferred time positions belonged within the outer tolerance window. Therefore, in most cases, only shifting operations were needed resulting in higher annotation efficiency. Results suggest that the state-of-the-art downbeat tracking is not yet efficient for expressive music.
Implementation of Waveshaper Audio Effect
Leitgeb, David ; Miklánek, Štěpán (referee) ; Schimmel, Jiří (advisor)
The aim of this thesis is the implementation of a non-linear audio effect called waveshaper. This type of distortion effect contains the following building blocks: user defined transfer function, several types of filters and an oversampling processor with multiple stages of oversampling. The first prototype of this audio effect was implemented using Matlab and its Audio Toolbox extension. Due to certain limitations of this prototype, the whole audio effect was later completely rewritten in C++. This new implementation uses the JUCE framework which is mainly used for audio application development. The transition to this framework allowed real time editing of the transfer function and a VST3 build of the effect. In addition to a brief introduction of the used system types, motivation for oversampling and the description of the implementation for both prototypes, this thesis also includes graphical examples demonstrating their correct functionality. Audio files related to these examples are included in the electronic attachment.
Interactive presentation of audiovisual works in virtual reality
Abdulvaliyev, Chamit ; Miklánek, Štěpán (referee) ; Sikora, Pavel (advisor)
This bachelor thesis is focused on development of a VR application which allows presentation of audiovisual works in a virtual space. This thesis was created in collaboration with Vasulka Kitchen Brno as part of the project Vasulka Live Archive / Interfaces and uses works of Steina and Woody Vasulkas. The application allows playing videos on a virtual canvas, rewinding videos, visualizing predictions generated by a neural network, changing the sorting of the thumbnails and changing the virtual environment. The prediction and sorting data are loaded from JSON files provided by the supervisor of the thesis. The videos are loaded from local storage before being played. Predictions are visualized with generated textures for every tag. The user has access to two environments, outer space and gallery, which was modelled after the exhibition space Vasulka Kitchen Brno. In the outer space environment, the user can move using the joysticks on the controllers, which simulates flight with a jet pack. Video rewinding and changing of the video sorting is controllable using the buttons on the controllers. The game engine Unity was used during the development and the application was tested on the VR headset Oculus Quest 2. The text of this bachelor thesis is divided in two parts. In the first part, the basic VR development context and game engines which can be used are described. These game engines are described: Godot, PlayCanvas and Unity. The second part focuses on the development of the VR application and the results of the bachelor thesis. It details the process of preparing a virtual scene and working with Unity editor. Also it explains the functions of the used scripts and ways of solving the problems met during the development.
Design and Realization of Acoustic Adjustments of Listening Room
Vajda, Michal ; Miklánek, Štěpán (referee) ; Schimmel, Jiří (advisor)
This bachelor thesis is devoted to the design of acoustic modifications of residential space and the subsequent implementation of these modifications. Its aim is to design an optimal solution of the space for listening to music in accordance with the Czech Technical Standards. The basic parameters investigated are reverberation time and acoustic quality criteria. The thesis deals with simulation in the EASE modelling program and addresses situations before and after acoustic treatment. The following methods and procedures are used to comprehensively address the above mentioned treatments and to achieve the set objective. Furthermore, the simulation is verified with a custom calculator in Excel. After using the absorbers suitable for adjustment, the implementation with the final measurement of the reverberation time is carried out. Finally, an additional modification design is created using Helmholtz resonators. The implemented methods achieve an optimal and frequency balanced reverberation time and the room can be used as a mixing hall for multi-channel sound recording.
Web interface for audio feature visualization
Putz, Viliam ; Ištvánek, Matěj (referee) ; Miklánek, Štěpán (advisor)
This thesis deals with methods of audio features extraction from audio files, visualization of these features and implementation of web interface, which provides the visualization. In the introduction, Music Information Retrieval field, with which this thesis is closely related, is described. Also, the current state in the area of applications for audio features extraction is described. Next, the most common libraries for audio feature extraction within the programming languages are listed. In the second chapter, the audio features that can be extracted from audio file are listed and described. In the third chapter, there is description of the process of implementation, used technologies, function diagram of the web interface, explanation of functionality and description of user interface and its functions.
System for finding duplicate recordings based on audio information
Švejcar, Michael ; Miklánek, Štěpán (referee) ; Ištvánek, Matěj (advisor)
This diploma thesis discusses different methods of detecting duplicates in a music file database. The problem at hand is that files containing the same recording may differ in sound quality, applause at the end of a performance and other such parameters. The aim of this thesis is to design and implement a system that identifies duplicate recordings and provides an output file for the comparison. The system needs to not be affected by the mentioned parameters but precise enough to prevent matching non-identical recordings. The system is realized using the Python programming language, freely available libraries for computing chroma features, Image Hashing technique and multiple variants of the dynamic time warping algorithm. Three comparison methods were implemented in the system, differing in precision and computation complexity. The methods were then tested on a prepared dataset and four preset precision options were created. The final system seems very precise and insusceptible to detecting recordings that are very similar but not identical as duplicates, for example in case of different interpretations of the same musical piece.
Neural Networks With Dilated Convolutions For Sound Event Recognition
Miklanek, Stepan
Convolutional neural networks, most commonly deployed in image classification tasks,typically use square-shaped convolutional kernels, which are well suited for feature extraction fromtwo-dimensional data. This study explores the effect of utilizing spectrally aware dilated convolutionsspecialized for sound event recognition. By extending the base kernels in the time or the frequencydimension, the features extracted from the spectral audio representations should, in theory, bettercapture the temporal and timbral information of different sound events. The baseline neural networkmodel with squared kernels was compared against three models, which used an increasing dilationfactor in the subsequent convolutional layers. The three models were purposefully tuned to focustowards the frequency and time feature extraction. The results have shown that the models withdilated convolutions performed noticeably better in comparison with the baseline model.
Extraction of parameters for the research of music performance
Laborová, Anna ; Miklánek, Štěpán (referee) ; Ištvánek, Matěj (advisor)
Different music performances of the same piece may significantly differ from each other. Not only the composer and the score defines the listener’s music experience, but the music performance itself is an integral part of this experience. Four parameter classes can be used to describe a performance objectively: tempo and timing, loudness (dynamics), timbre, and pitch. Each of the individual parameters or their combination can generate a unique characteristic performance. The extraction of such objective parameters is one of the difficulties in the field of Music Performance Analysis and Music Information Retrieval. The submitted work summarizes knowledge and methods from both of the fields. The system is applied to extract data from 31 string quartet performances of 2. movement Lento of String Quartet no. 12 F major (1893) by czech romantic composer Antonín Dvořák (1841–1904).
Beat tracking systems for music recordings
Staňková, Karolína ; Miklánek, Štěpán (referee) ; Ištvánek, Matěj (advisor)
This master thesis deals with systems for detecting rhythmic structures of music recordings. The field of Music Information Retrieval (MIR) allows us to examine the harmonic and tonal properties of music, rhythm, tempo, etc., and has uses in academic and commercial sphere. Various algorithms are used in the detection of rhythmic structures. However, today, most new methods use neural networks. This work aims to summarize the current research results of systems for detecting beats and tempo, to describe methods of calculating and evaluating the parameters of music recordings, and to implement a program that allows comparison of available detection systems. The result of the work is a script in the Python language, which uses six different systems to detect the rhythmic structure of test recordings. It then checks the outputs of the algorithms according to the given reference and compares the given systems with each other using several evaluation values. It uses two datasets as a reference—one of them is publicly available and the other was created by the author of this thesis (including annotations, i.e., reference beat times, for the sample recordings). The program allows user to see the results in graphs and play any of the sample recordings with detected beat times.
Audio signal modelling using neural networks
Pešán, Michele ; Ištvánek, Matěj (referee) ; Miklánek, Štěpán (advisor)
Neuronové sítě vycházející z architektury WaveNet a sítě využívající rekurentní vrstvy jsou v současnosti používány jak pro syntézu lidské řeči, tak pro „black box“ modelování systémů pro úpravu akustického signálu – modulační efekty, nelineární zkreslovače apod. Úkolem studenta bude shrnout dosavadní poznatky o možnostech využití neuronových sítí při modelování akustických signálů. Student dále implementuje některý z modelů neuronových sítí v programovacím jazyce Python a využije jej pro natrénování a následnou simulaci libovolného efektu nebo systému pro úpravu akustického signálu. V rámci semestrální práce vypracujte teoretickou část práce, vytvořte zvukovou databázi pro trénování neuronové sítě a implementujte jednu ze struktur sítí pro modelování zvukového signálu. Neuronové sítě jsou v průběhu posledních let používány stále více, a to víceméně přes celé spektrum vědních oborů. Neuronové sítě založené na architektuře WaveNet a sítě využívající rekurentních vrstev se v současné době používají v celé řadě využití, zahrnující například syntézu lidské řeči, nebo napřklad při metodě "black-box" modelování akustických systémů, které upravují zvukový signál (modulačí efekty, nelineární zkreslovače, apod.). Tato akademická práce si dává za cíl poskytnout úvod do problematiky neuronových sítí, vysvětlit základní pojmy a mechanismy této problematiky. Popsat využití neuronových sítí v modelování akustických systémů a využít těchto poznatků k implementaci neuronových sítí za cílem modelování libovolného efektu nebo zařízení pro úpravu zvukového signálu.

National Repository of Grey Literature : 36 records found   beginprevious23 - 32next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.