National Repository of Grey Literature 40 records found  1 - 10nextend  jump to record: Search took 0.00 seconds. 
Controlling Computer Using Gestures
Lacko, Peter ; Herout, Adam (referee) ; Juránek, Roman (advisor)
This work deals with creation of system for controlling computer through webcam with gestures. Gesture in this work can be viewed as hand motion forming some pattern. In the beginning are described methods for hand detection, hand tracking and pattern recognition. Afterwards comes description of system and it's implementation with tests evaluation. Outcome of this work is program for simple control of document viewer and multimedia player.
Integration of Voice Technologies on Mobile Platforms
Černičko, Sergij ; Černocký, Jan (referee) ; Schwarz, Petr (advisor)
The goal of the thesis is being familiar with methods a techniques used in speech processing. Describe the current state of research and development of speech technology. Project and implement server speech recognizer that uses BSAPI. Integrate client that will use server for speech recognition to mobile dictionaries of Lingea company.
Visualization of User Pronunciations for Electronic Dictionarties
Pešán, Jan ; Chalupníček, Kamil (referee) ; Černocký, Jan (advisor)
The aim of this bachelor's work is to try to find a new way for development in learning capabilities of electronic dictionaries. There is an introduction of the main concept of learning pronunciations with visualization of phonemes in the first part. It is followed by chapter, which does a global review of methods for speech processing used in this project, e.g. HMM or Viterbi algorithm. In the third chapter, there is description of tools that we have used for implementation of the whole system. Next chapter explains more in detail technology of neural networks, used here as probability estimator. There is also a description of problem with compatibility of the used phoneme sets and in addition, it describes used phoneme models. Chapter 5 is whole about implementation of the system. There are also described scripts and tools applied for the preparation of the source data. In the next chapter, there is a user testing with screenshots. Moreover, in the last chapter I wrote a short conclusion and possible future ways for further developing of this system.
Speech Recognition For Selected Languages
Schmitt, Jan ; Karafiát, Martin (referee) ; Janda, Miloš (advisor)
This bachelor's thesis deals with recognition of continues speech for three languages - Bulgarian, Croatian and Swedish. There are described basics of speech processing and recognition methods like acoustic modeling using hidden Markov models and gaussian mixture models. Another aim of this work is preparing data for those languages from GlobalPhone database, so they may be used with speech recognition toolkits Kaldi and HTK. With data prepared there are several models trained and tested using Kaldi toolkit.
Speech Recognition (digit)
Kantar, Martin ; Minář, Petr (referee) ; Matoušek, Radomil (advisor)
The aim of this diploma thesis is to explain what speech is and what are its constituents. I mention commonly used methods which are used for preparation of signals which we use for recognition. Schematic examples show principles of current recognizers of speech, their advantages and disadvantages. I made speech recognition program for 0-9 numerals in Matlab for neural nets learning.
Speech recognition using Sphinx-4
Kryške, Lukáš ; Uher, Václav (referee) ; Burget, Radim (advisor)
This diploma thesis is aimed to find an effective method for continuous speech recognition. To be more accurate, it uses speech-to-text recognition for a keyword spotting discipline. This solution is able to be applicable for phone calls analysis or for a similar application. Most of the diploma thesis describes and implements speech recognition framework Sphinx-4 which uses Hidden Markov models (HMM) to define a language acoustic models. It is explained how these models can be trained for a new language or for a new language dialect. Finally there is in detail described how to implement the keyword spotting in the Java language.
Prediction of p53 Protein Binding Sites
Radakovič, Jozef ; Vogel, Ivan (referee) ; Martínek, Tomáš (advisor)
Protein p53 which is encoded by gene TP53 plays crucial role in cell cycle as a regulator of transcription of genes in cases when cell is under stress. Therefore p53 acts like tumor suppressor. Understanding the pathway of p53 regulation as well as predicting its binding sites on p53 regulated genes is one of the major concerns of modern research in genetics and bioinformatics. In first part of this project we aim to introduce basics from molecular biology to better understand the p53 protein pathway in gene transcription and introduction to analysis of prediction of p53 binding sites. Second part is about implementation and testing of tool which would be able to predict transcription factor binding sites for protein p53.
Acoustic signal classification
Pospíšil, Aleš ; Balík, Miroslav (referee) ; Atassi, Hicham (advisor)
Bachelor's thesis is focused on automatic music genre classication. First part of work evaluates present situation and refer to published studies. Gained knowledge from there is applied in this work. In terms of nding solution for problem the work summarize and describe suitable music features and classication techniques like neural networks and k-nearest neighbor. Four selected classication classes were classical, electro, jazz and rock music. Result of work is user-friendly system that provides automatic music genre recognition. Achieved classication performance is more less comparable to human music genres recognition.
Decoder for key word detection system
Krotký, Jan ; Míča, Ivan (referee) ; Pfeifer, Václav (advisor)
The essay presents the basic characteristics of human speech recognition, describes systems for the detection of key words and further deals with the proposal of each decoder blocks divided into three chapters. The first one describes the operations that are performed before the signal distribution of the framework and the segmentation. The second chapter describes the calculation of short-term energy, the number of zero passes and self-correlative, prediction and Mel-frequency cepstral coefficients. The third chapter, which describes the design of the block decoder, describes the method of dynamic time destruction and the method based on hidden Markov model. The final part of the essay describes decoders working with a speech and a proposal for a simple decoder working with isolated words, which was based issued and tested based on the preceding chapters.
Modern methods of multimedia teaching
Mazal, Zdeněk ; Přinosil, Jiří (referee) ; Pfeifer, Václav (advisor)
The work is a summary of the advantages and disadvantages of e-learning, the next section deals with search keywords in sound record, where the survey methods used, operating search engines, their division and the possibilities of use. It also includes the design, implementation and results of the success of a simple search engine of the words in sound record, programmed in Matlab Environment.

National Repository of Grey Literature : 40 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.