National Repository of Grey Literature 83 records found  beginprevious21 - 30nextend  jump to record: Search took 0.01 seconds. 
Automatic Pronunciation Evaluation of Non-Native English Speakers
Gazdík, Peter ; Szőke, Igor (referee) ; Žmolíková, Kateřina (advisor)
Computer-Assisted Pronunciation Training (CAPT) is becoming more and more popular these days. However, the accuracy of existing CAPT systems is still quite low. Therefore, this diploma thesis focuses on improving existing methods for automatic pronunciation evaluation on the segmental level. The first part describes common techniques for this task. Afterwards, we proposed the system based on two approaches. Finally, performed experiments show significant improvement over the reference system.
Pedestrian Identification
Jurča, Jan ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
This thesis deals with pedestrian identification from video sequence based on person, face and gait recognition. For person and face recognition are used pretrained networks. While for gait recognition is implemented and compared many different networks. Final pedestrian recognition is based on multimodal fusion realized by neural network. For the purpose of the work was created dataset, along with a set of tools that allow its almost automatic creation.
Automatic Chord Recognition Using Deep Neural Networks
Nodžák, Petr ; Bidlo, Michal (referee) ; Vašíček, Zdeněk (advisor)
This work deals with automatic chord recognition using neural networks. The problem was separated into two subproblems. The first subproblem aims to experimental finding of most suitable solution for a acoustic model and the second one aims to experimental finding of most suitable solution for a language model. The problem was solved by iterative method. First a suboptimal solution of the first subproblem was found and then the second one. A total of 19 acoustic and 12 language models were made. Ten training datasets was created for acoustic models and three for language models. In total, over 200 models were trained. The best results were achieved on acoustic models represented by convolutional networks together with language models represented by recurent networks with LSTM modules.
Efficiency of deep convolutional neural networks on an elementary classification task
Prax, Jan ; Dobrovský, Ladislav (referee) ; Škrabánek, Pavel (advisor)
In this thesis deep convolutional neural networks models and feature descriptor models are compared. Feature descriptors are paired with suitable chosen classifier. These models are a part of machine learning therefore machine learning types are described in this thesis. Further these chosen models are described, and their basics and problems are explained. Hardware and software used for tests is listed and then test results and results summary is listed. Then comparison based on the validation accuracy and training time of these said models is done.
Convolutional Networks for Document Layout Analysis
Endrych, David ; Herout, Adam (referee) ; Kodym, Oldřich (advisor)
The goal of this thesis is to create a tool for analyzig the page layouts of text documents. The problem is solved by convolution neural networks. The architecture chosen in this thesis is the U-Net architecture. The cross entropy error function with weight map is used for train the network model. Paragraph regions are obtained throught connected component analysis. Experiments are evaluated using the Symmetric Best Dice object metric. Experiments have shown that it is better to use all paragraph edges than to focus only on vertical paragraph edges. In addition, experiments show that batche sampling strategies and adaptive resolution help to improve analysis results. The experiments also describe the application of separators, which is useful in analyzing multi-column documents.
Deep Learning for Object Detection
Pitoňák, Radoslav ; Dobeš, Petr (referee) ; Teuer, Lukáš (advisor)
This thesis analyzes different object detection methods which are based on deep neural networks. In the beginning, the convolutional neural networks are described and commonly used object detection methods are compared. In the following parts, the proposal and implementation of the object detection model trained on the specific dataset are described. In conclusion, the achieved results of this model are discussed and compared with the results of other methods.
Photo Noise Reduction Using Deep Neural Networks
Tichý, Jonáš ; Juránek, Roman (referee) ; Španěl, Michal (advisor)
Obrazový šum je fundamentálním problémem v digitální fotografii. Cílem této práce je studium redukce šumu ve fotografiích pomocí hlubokých neuronových sítí. Dvě vybrané metody založené na hlubokých neuronových sítích, DnCNN a BRDNet, byly implementovány a jejich výkon byl změřen v několika experimentech. Kromě toho byl navržen a proveden experiment na uživatelích s cílem vyhodnotit vnímanou kvalitu obrazu širokou veřejností. Experimenty ukázaly, že zatímco obě metody dosahují výborných výsledků v metrikách, jako je PSNR a SSIM, vnímaná vizuální kvalita ne vždy koreluje s numerickými metrikami. Výsledky prezentované v této práci zdůrazňují důležitost vhodných trénovacích dat a metrik kvality obrazu v odšumování digitálních fotografií.
Deep Neural Networks for Classifying Objects in an Image
Mlynarič, Tomáš ; Zemčík, Pavel (referee) ; Hradiš, Michal (advisor)
This paper deals with classifying objects using deep neural networks. Whole scene segmentation was used as main algorithm for the classification purpose which works with video sequences and obtains information between two video frames. Optical flow was used for getting information from the video frames, based on which features maps of a~neural network are warped. Two neural network architectures were adjusted to work with videos and experimented with. Results of the experiments show, that using videos for image segmentation improves accuracy (IoU) compared to the same architecture working with images.
Visual Car-Detection on the Parking Lots Using Deep Neural Networks
Stránský, Václav ; Veľas, Martin (referee) ; Rozman, Jaroslav (advisor)
The concept of smart cities is inherently connected with efficient parking solutions based on the knowledge of individual parking space occupancy. The subject of this paper is the design and implementation of a robust system for analyzing parking space occupancy from a multi-camera system with the possibility of visual overlap between cameras. The system is designed and implemented in Robot Operating System (ROS) and its core consists of two separate classifiers. The more successful, however, a slower option is detection by a deep neural network. A quick interaction is provided by a less accurate classifier of movement with a background model. The system is capable of working in real time on a graphic card as well as on a processor. The success rate of the system on a testing data set from real operation exceeds 95 %.
Document Quality Enhancement
Trčka, Jan ; Zemčík, Pavel (referee) ; Juránek, Roman (advisor)
The aim of this work is to increase the accuracy of the transcription of text documents. This work is mainly focused on texts printed on degraded materials such as newspapers or old books. To solve this problem, the current method and problems associated with text recognition are analyzed. Based on the acquired knowledge, the implemented method based on GAN network architecture is chosen. Experiments are a performer on these networks in order to find their appropriate size and their learning parameters. Subsequently, testing is performed to compare different learning methods and compare their results. Both training and testing is a performer on an artificial data set. Using implemented trained networks increases the transcription accuracy from 65.61 % for the raw damaged text lines to 93.23 % for lines processed by this network.

National Repository of Grey Literature : 83 records found   beginprevious21 - 30nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.