National Repository of Grey Literature 17 records found  1 - 10next  jump to record: Search took 0.00 seconds. 
Voice-controlled app for capturing photos and clips
Jurčík, Adam Dalibor ; Španěl, Michal (referee) ; Herout, Adam (advisor)
The goal of this work is to create an application that solves the problems of video and photo recording while the same person is playing sports. These problems are, for example, the need to carry a remote control or the constant interruption of the activity due to manual activation. This is solvedby using voice control and keywords to trigger the taking of a photo. This paper discusses in detail how voice control can be implemented on Android operating system in Java programming language. It also includes an explanation of how to store these files and how to control hardware devices such as the camera. The thesis includes a more detailed description of two approaches to voice recognition and describes their functionality in different applications.
Controlling Mobile App by Voice
Cologna, Adam ; Szőke, Igor (referee) ; Herout, Adam (advisor)
The aim of this bachelor's thesis is to explore, analyze, and compare available libraries for controlling mobile applications with voice commands. According to the thesis requirements, I solved the problem primarily for the Android operating system. I have considered not only libraries for keyword detection but also those using the speech recognition technology. For the selected libraries, I conducted accuracy testing for voice command detection and performance testing. To facilitate the integration of the chosen libraries, an application was developed in the modern programming language Kotlin using Jetpack Compose. The main contribution of this work is the experiments carried out and the resulting recommendations regarding the examined libraries. Among the most effective solutions were those from Microsoft Azure and the Android Speech Recognizer class. The main obstacles for each library were the distance between the speaker and the microphone, as well as distracting background music.
Search in speech recordings based on semantic vectors
Boboš, Dominik ; Karafiát, Martin (referee) ; Schwarz, Petr (advisor)
V současné době přetížené informacemi jsou efektivní metody vyhledávání informací velice žádané. Tato práce shrnuje metody pro získávání vektorových reprezentací pro text a zvuk, známé také jako sémantické vektory. Podívali jsme se hlouběji na multimodální mo\-de\-ly, jako jsou SpeechT5 a SeamlessM4T, které transformují tyto typy vstupu do jednoho sdíleného vektorového prostoru. Na základě těchto modelů jsme vybudovali systém, který nám umožňuje vyhledávat v datech bez ohledu na modalitu. Abychom mohli vyhodnotit navrhované řešení, kromě standardního rozpoznávání klíčových slov, také pro úlohy sémantického vyhledávání, manuálně jsme označili datovou sadu pro zachycení podobných sémantických významů klíčových slov nebo frází. Nakonec jsme provedli několik experimentů, kde jsme prozkoumali možnosti modelů omezením pozorovaného kontextu během dotrénovaní neuronové sítě nebo zapojením systémů převodu textu na řeč (TTS) ke zlepšení celkového výkonu.
Vizualization of Outputs from Speech Technologies for Contact Centers
Zhezhela, Oleksandr ; Szőke, Igor (referee) ; Schwarz, Petr (advisor)
The thesis is aimed on visualisation of data mined by speech processing technologies. Some methods speech data extraction were studied and technologies for this task were analysed. The variety of meta data that can be mined from speech was defined. Were also examined existing standards and processes of call centres. Some requirements for the user interface were gathered and analysed. On that basis and after communication with call centre employees there was defined and implemented a concept for speech data visualization. Gained solutions were integrated into Speech Analytics Server (SPAS).
Query-by-Example Keyword Spotting
Skácel, Miroslav ; Hannemann, Mirko (referee) ; Szőke, Igor (advisor)
Tato diplomová práce se zabývá moderními přístupy detekce klíčových slov a detekce frází v řečových datech. V úvodní části je seznámení s problematikou a teoretický popis metod pro detekci. Následuje popis reprezentace vstupních datových sad použitých při experimentech a evaluaci. Dále jsou uvedeny metody pro detekci klíčových slov definovaných vzorem. Následně jsou popsány evaluační metody a techniky použité pro skórování. Po provedení experimentů na datových sadách a po evaluaci jsou diskutovány výsledky. V dalším kroku jsou navrženy a poté implementovány moderní postupy vedoucí k vylepšení systému pro detekci a opět je provedena evaluace a diskuze dosažených výsledků. V závěrečné části je práce zhodnocena a jsou zde navrženy další směy vývoje našeho systému. Příloha obsahuje manuál pro používání implementovaných skriptů.
Keyword Spotting Implementation to Mobil Phone (Symbian 60)
Cipr, Tomáš ; Schwarz, Petr (referee) ; Szőke, Igor (advisor)
Keyword spotting is one of the many applications of automatic speech recognition. Its purpose is determining spots in given utterance in which some of the specified words were spoken. Keyword spotting has a great potential to enhance performance of new applications as well as the existing ones. An example could be a mobile phone voice control. Due to OS Symbian's coming to the market it is even possible for end user to implement a keyword spotting for a mobile phone on his or her own. The thesis describes theoretical prerequisites for keyword spotting and its implementation. Firstly the OS Symbian is presented with respect to the given task. Secondly each step of keyword spotting process is described. Finally the object design of keyword spotter is presented followed by implementation description. The thesis concludes with results review and notes on possible improvements.
Multimedia Browser for Lectures
Kubalík, Jakub ; Potúček, Igor (referee) ; Schwarz, Petr (advisor)
In preamble reader will understand the purpose of this project and why it was initiated. It's large team project, so description of its structure represents main part of this report. A core of the browser contain a lot of techniques, e.g. system of components, special internal communication and other. The main goals of this system are portability and reusability - there is exact specification of syntax rules and used data formats. Non-interchangeable role in project has format XML. Invention of new components was the last intervention in project, with intention to use of the project in future and involve in complex search engine - the engine discuss final part of report.
Electronic Flight Bag
Kúšik, Lukáš ; Vlk, Jan (referee) ; Chudý, Peter (advisor)
Cieľom tejto diplomovej práce je vytvoriť Electronic Flight Bag (EFB) aplikáciu pre mobilné telefóny s operačným systémom Android. Pre splnenie tejto úlohy bola preskúmaná aktuálna legislatíva ohľadom EFB aplikácií spolu s najmodernejšími EFB aplikáciami dostupnými na aplikačnom trhu. Na základe týchto informácií je navrhnutá a implementovaná EFB aplikácia určená pre pilotov všeobecného letectva. Výsledný produkt obsahuje funkcie pre plánovanie letu, vlastnú leteckú mapu, pilotný denník, katalóg letísk s dátami z celého sveta a ďalšie. Podpora offline zaručuje funkčnosť v reálnych podmienkach letu. Konečný produkt sa taktiež snaží inovovať nad existujúcimi EFB aplikáciami zahrnutím funkcionalít, akými sú napríklad automatické kontrolné zoznamy a náhľad v rozšírenej realite.
Keyword spotting
Zemánek, Tomáš ; Ošmera, Pavel (referee) ; Pfeifer, Václav (advisor)
This thesis is aimed on design keyword detector. The work contains a description of the methods that are used for these purposes and design of algorithm for keyword detection. The proposed detector is based on the method of DTW (Dynamic Time Warping). Analysis of the problem was performed on the module programmed in ANSI C, which was created within the thesis. The results of the detector were evaluated using the metrics WER (word error rate) and AUC (area under curve).
Optimization of Voice Recognition for Mobile Devices
Tomec, Martin ; Zbořil, František (referee) ; Hanáček, Petr (advisor)
This work deals with optimization of keyword spotting algorithms   on processor architecture ARM Cortex-A8. At first it describes this    architecture and especially the NEON unit for vector computing.   In addition it briefly describes keyword spotting algorithms and also there is proposed optimization of these algorithms for described architecture. Main part of this work is implementation of these optimizations and analysis of their impact on performance.

National Repository of Grey Literature : 17 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.