National Repository of Grey Literature 4 records found  Search took 0.01 seconds. 
Vizualization of Outputs from Speech Technologies for Contact Centers
Zhezhela, Oleksandr ; Szőke, Igor (referee) ; Schwarz, Petr (advisor)
The thesis is aimed on visualisation of data mined by speech processing technologies. Some methods speech data extraction were studied and technologies for this task were analysed. The variety of meta data that can be mined from speech was defined. Were also examined existing standards and processes of call centres. Some requirements for the user interface were gathered and analysed. On that basis and after communication with call centre employees there was defined and implemented a concept for speech data visualization. Gained solutions were integrated into Speech Analytics Server (SPAS).
A tool for simultaneous playback of multiple composition interpretations
Švejcar, Michael ; Ištvánek, Matěj (referee) ; Miklánek, Štěpán (advisor)
The purpose of this Bachelor’s thesis was to create a piece of software which enables the user to simultaneously play back multiple interpretations of a musical piece and switch between them instantaneously. This was achieved using the App Designer in the MATLAB environment, which is intended for developing applications with graphical user interface. The key to the development of the application was especially the use of available toolboxes and algorithms for computing chromagrams and multiscale dynamic time warping. The final IntSwitcher player enables the user to load two recordings of interpretations of one song. Chromagrams which characterize the individual recordings in terms of tonal development over time are first calculated from the input files. After that, the multiscale dynamic time warping method is applied on the chromagrams, which outputs the warping path. The warping path in this case is a matrix, in which musically corresponding samples of loaded audio files are assigned together with the resolution of 50 ms. From this, the corresponding time position of currently inactive track is computed along with its slider position. If the user switches the currently played recording, the second track starts playing in the same part of composition, even if that part is at a different time in each of the individual recordings. The final software is an appropriate tool for studying differences between various interpretations of the same musical piece.
A tool for simultaneous playback of multiple composition interpretations
Švejcar, Michael ; Ištvánek, Matěj (referee) ; Miklánek, Štěpán (advisor)
The purpose of this Bachelor’s thesis was to create a piece of software which enables the user to simultaneously play back multiple interpretations of a musical piece and switch between them instantaneously. This was achieved using the App Designer in the MATLAB environment, which is intended for developing applications with graphical user interface. The key to the development of the application was especially the use of available toolboxes and algorithms for computing chromagrams and multiscale dynamic time warping. The final IntSwitcher player enables the user to load two recordings of interpretations of one song. Chromagrams which characterize the individual recordings in terms of tonal development over time are first calculated from the input files. After that, the multiscale dynamic time warping method is applied on the chromagrams, which outputs the warping path. The warping path in this case is a matrix, in which musically corresponding samples of loaded audio files are assigned together with the resolution of 50 ms. From this, the corresponding time position of currently inactive track is computed along with its slider position. If the user switches the currently played recording, the second track starts playing in the same part of composition, even if that part is at a different time in each of the individual recordings. The final software is an appropriate tool for studying differences between various interpretations of the same musical piece.
Vizualization of Outputs from Speech Technologies for Contact Centers
Zhezhela, Oleksandr ; Szőke, Igor (referee) ; Schwarz, Petr (advisor)
The thesis is aimed on visualisation of data mined by speech processing technologies. Some methods speech data extraction were studied and technologies for this task were analysed. The variety of meta data that can be mined from speech was defined. Were also examined existing standards and processes of call centres. Some requirements for the user interface were gathered and analysed. On that basis and after communication with call centre employees there was defined and implemented a concept for speech data visualization. Gained solutions were integrated into Speech Analytics Server (SPAS).

Interested in being notified about new results for this query?
Subscribe to the RSS feed.