National Repository of Grey Literature 134 records found  previous11 - 20nextend  jump to record: Search took 0.01 seconds. 
Using of voice for home eguipment control
Bubla, Lukáš ; Vojáček, Zdeněk (referee) ; Lacko, Branislav (advisor)
The bachelor thesis is focused on home automation, specifically on voice control. It describes methods of communication and control of automated systems and especially problematics of computer voice recognition using different methods. The thesis displays specific product for voice controlling developed by Czech company Elko ep, s.r.o. and then briefly looks at the issue of gesture control. At the end compares product from Elko ep with other products from foreign companies and discuss advantages and disadvantages of voice or gestures control.
Speech Recognition For Selected Languages
Schmitt, Jan ; Karafiát, Martin (referee) ; Janda, Miloš (advisor)
This bachelor's thesis deals with recognition of continues speech for three languages - Bulgarian, Croatian and Swedish. There are described basics of speech processing and recognition methods like acoustic modeling using hidden Markov models and gaussian mixture models. Another aim of this work is preparing data for those languages from GlobalPhone database, so they may be used with speech recognition toolkits Kaldi and HTK. With data prepared there are several models trained and tested using Kaldi toolkit.
Voice recognition of standard PILOT-CONTROLLER control commands
Kufa, Tomáš ; Polách, Petr (referee) ; Honzík, Petr (advisor)
The subject of this graduation thesis is an application of speech recognition into ATC commands. The selection of methods and approaches to automatic recognition of ATC commands rises from detailed air traffic studies. By the reason that there is not any definite solution in such extensive field like speech recognition, this diploma work is focused just on speech recognizer based on comparison with templates (DTW). This recognizor is in this thesis realized and compared with freely accessible HTK system from Cambrige University based on statistic methods making use of Hidden Markov models. The usage propriety of both methods is verified by practical testing and results evaluation.
Character recognition in the soundtrack with SOM
Malásek, Jan ; Honzík, Petr (referee) ; Honzík, Petr (referee) ; Pohl, Jan (advisor)
This bachelor´s thesis describes a history of neural networks evolution and their using in speech recognition systems and shows problems with working and learning neural networks. It presents three chosen systems for speech recognition including their evaluation in experiments, their advantages and disadvantages. It is also about human speech characteristics and systems of its recognition. The last part is focused on frequency spectrums of different types of vowels and gives instructions for programming neural networks using MATLAB.
Speech Recognition (digit)
Kantar, Martin ; Minář, Petr (referee) ; Matoušek, Radomil (advisor)
The aim of this diploma thesis is to explain what speech is and what are its constituents. I mention commonly used methods which are used for preparation of signals which we use for recognition. Schematic examples show principles of current recognizers of speech, their advantages and disadvantages. I made speech recognition program for 0-9 numerals in Matlab for neural nets learning.
Recognition of Multi-Talker Overlapping Speech Using Neural Networks
Hradil, Jaromír ; Švec, Ján (referee) ; Žmolíková, Kateřina (advisor)
Tato práce se zabývá rozpoznáváním řeči překrývajících se řečníků pomocí neuronové sítě. Zkoumá  problém rozpoznávání řečí od vícero řečníků a způsoby, jimiž se tento daný problém řeší. Jedná se konkrétně o aplikaci kromě tradičních komponentů jako konvoluční neuronové sítě, LSTM atd. také speciálních komponentů: attention mechanismus a gated konvoluce. A dále také aplikace techniky zvanou permutation invariant training. Součástí této práce je aplikování těchto přístupů na přidělená trénovací data, která jsou tvořena uměle vytvořenými směsmi dvou řečníků předčítající články z Wall Street Journal. Dalším krokem bylo natrénování příslušných architektur používající kombinující prvky zmíněné nahoře. Modely v této práci nahrazují akustický model. Jednalo se o dvě architektury užívající různé typy attention mechanismu a o jednu bez něj.  Experimenty ukázaly, že architektury užívající attention mechanismus v tomto typu úlohy něpřekonaly tradičnější architekturu s užitím gated konvolucí. Přesto ale ukázaly potenciál.
Voice Controlled Calculator
Pavelek, Ota ; Szőke, Igor (referee) ; Grézl, František (advisor)
This thesis describes implementation of calculator, which can be controlled by both voice and normal way. Speech recognizing is realized with BSCORE library and has recognition network restricted to words needed by calculator. When the recognition is done, recognized sentence is transformed to expression and shown to user, so it can be corrected (especially in case of erroneous recognition). Expression is evaluated on user's request. The purpose of voice control is to make usage of calculator more effective and accesible for handicapped users.
Detection of speech disorders
Struhař, Michal ; Rajmic, Pavel (referee) ; Sysel, Petr (advisor)
This thesis deals with detection of speech disorders. One of the aims of this thesis is choosing suitable parameterization: short-time energy, zero-crossing rate, linear predictive analysis, perceptual linear predictive analysis, RASTA method, cepstral analysis and mel-frequency cepstral coefficient can be chosed for detections. Next aim is construction of detector of speech disorders based on DTW (Dynamic Time Warping) and artificial neuron network. Single detection proceeds on the base of collected tokens from chosen analysis and phonetic transcription of speech. Analyses, detector and phonetic transcription of Czech language are implemented in simulation environment of MATLAB.
Voice Dialog System in Web Browser for Demonstration Purposes
Vlček, Pavol ; Glembek, Ondřej (referee) ; Schwarz, Petr (advisor)
Cieľom práce je navrhnúť a vytvoriť hlasom ovládaného asistenta(voicebota), ktorý bude ľahko nasaditeľný na webovú stránku. Používateľom tak bude poskytnutý moderný spôsob, ako prirodzene komunikovať cez internetový prehliadač. Hlavný dôraz je kladený na synchronizáciu medzi hlasovým asistentom a obsahom na webovej stránke. Synchronizácia je dosiahnutá obojsmerným prenosom hlasu a textových príkazov medzi klientom a serverom. Na to je použitá technológia WebRTC v kombinácií so signalizačným protokolom SIP. Práca sa zaoberá oblasťami ako VoIP telefonovanie, počítačové siete a strojové učenie(proprietárne rečové technológie od Phonexie). Benefitom nasadenia hlasového asistenta je zníženie nákladov na odchádzajúce hovory pre klientov, odľahčenie agentov na call centrách pri odpovedaní na často kladené otázky a zvýšenie záujmu zákazníkov vďaka použitiu nových technológií.
Vizualization of Outputs from Speech Technologies for Contact Centers
Zhezhela, Oleksandr ; Szőke, Igor (referee) ; Schwarz, Petr (advisor)
The thesis is aimed on visualisation of data mined by speech processing technologies. Some methods speech data extraction were studied and technologies for this task were analysed. The variety of meta data that can be mined from speech was defined. Were also examined existing standards and processes of call centres. Some requirements for the user interface were gathered and analysed. On that basis and after communication with call centre employees there was defined and implemented a concept for speech data visualization. Gained solutions were integrated into Speech Analytics Server (SPAS).

National Repository of Grey Literature : 134 records found   previous11 - 20nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.