National Repository of Grey Literature 2 records found  Search took 0.02 seconds. 
Cover Song Identification using Music Harmony Features, Model and Complexity Analysis
Maršík, Ladislav ; Pokorný, Jaroslav (advisor) ; Ge, Mouzhi (referee) ; Łukasik, Ewa (referee)
Title: Cover Song Identification using Music Harmony Features, Model and Complexity Analysis Author: Ladislav Maršík Department: Department of Software Engineering Supervisor: Prof. RNDr. Jaroslav Pokorný, CSc., Department of Software Engineering Abstract: Analysis of digital music and its retrieval based on the audio fe- atures is one of the popular topics within the music information retrieval (MIR) field. Every musical piece has its characteristic harmony structure, but harmony analysis is seldom used for retrieval. Retrieval systems that do not focus on similarities in harmony progressions may consider two versions of the same song different, even though they differ only in instrumentation or a singing voice. This thesis takes various paths in exploring, how music harmony can be used in MIR, and in particular, the cover song identification (CSI) task. We first create a music harmony model based on the knowledge of music theory. We define novel concepts: a harmonic complexity of a mu- sical piece, as well as the chord and chroma distance features. We show how these concepts can be used for retrieval, complexity analysis, and how they compare with the state-of-the-art of music harmony modeling. An extensive comparison of harmony features is then performed, using both the novel fe- atures and the...
Feature Evaluation for Scalable Cover Song Identification Using Machine Learning
Martišek, Petr ; Maršík, Ladislav (advisor) ; Hajič, Jan (referee)
Cover song identification is a field of music information retrieval where the task is to determine whether two different audio tracks represent different versions of the same underlying song. Since covers might differ in tempo, key, instrumentation and other characteristics, many clever features have been developed over the years. We perform a rigorous analysis of 32 features used in related works while distinguishing between exact and scalable features. The former are based on a harmonic descriptor time series (typically chroma vectors) and offer better performance at the cost of computation time. The latter have a small constant size and only capture global phenomena in the track, making them fast to compute and suitable for use with large datasets. We then select 7 scalable and 3 exact features to build our own two-level system, with the scalable features used on the first level to prune the dataset and the exact on the second level to refine the results. Two distinct machine learning models are used to combine the scalable resp. exact features. We perform the analysis and the evaluation of our system on the Million Song Dataset. The experiments show the exact features being outperformed by the scalable ones, which lead us to a decision to only use the 7 scalable features in our system. The...

Interested in being notified about new results for this query?
Subscribe to the RSS feed.