National Repository of Grey Literature 69 records found  beginprevious16 - 25nextend  jump to record: Search took 0.01 seconds. 
Deep Learning for Facial Recognition in Video
Jeřábek, Vladimír ; Sochor, Jakub (referee) ; Hradiš, Michal (advisor)
This work deals with face recognition in video using neural networks. In the beginning, there is described the process of selection and verification of convolution neural network to generate feature vectors from images of different identities. In the next part, this work deals with the aggregation of feature vectors from video frames. Aggregation takes place through aggregation neural networks. At the end of this work, the results obtained by the aggregation methods are discussed.
Deep Learning for Facial Recognition in Video
Stratil, Jan ; Sochor, Jakub (referee) ; Hradiš, Michal (advisor)
This bachelor's thesis deals with facial recognition in video using deep neural networks. This task is split into 2 parts. The first part deals with training network that produces compact feature vector which represents the face identity from a video frame. The second part deals with training aggregation network that aggregates those feature vectors into one. This aggregation is fast and it has shown that its results are better than naive pooling methods. Results are tested on the LFW dataset, where it achieves 92.8% accuracy and on the YTF dataset, where the accuracy is 84.06%.
Deep Learning for Image Classification
Ziková, Jana ; Veľas, Martin (referee) ; Hradiš, Michal (advisor)
This bachelor thesis deals with electronic commerce website products classification using product's photographs. For this purpose we use already implemented models of deep convolutional neural networks. Tho goal of this theses is to design experiments that will lead to the best possible results in product images classification.
Multi-Task Neural Networks for Speech Recognition
Egorova, Ekaterina ; Veselý, Karel (referee) ; Karafiát, Martin (advisor)
První část této diplomové práci se zabývá teoretickým rozborem principů neuronových sítí, včetně možnosti jejich použití v oblasti rozpoznávání řeči. Práce pokračuje popisem viceúkolových neuronových sítí a souvisejících experimentů. Praktická část práce obsahovala změny software pro trénování neuronových sítí, které umožnily viceúkolové trénování. Je rovněž popsáno připravené prostředí, včetně několika dedikovaných skriptů. Experimenty představené v této diplomové práci ověřují použití artikulačních characteristik řeči pro viceúkolové trénování. Experimenty byly provedeny na dvou řečových databázích lišících se kvalitou a velikostí a representujících různé jazyky - angličtinu a vietnamštinu. Artikulační charakteristiky byly také kombinovány s jinými sekundárními úkoly, například kontextem, s záměrem ověřit jejich komplementaritu. Porovnaní je provedeno s neuronovými sítěmi různých velikostí tak, aby byl popsán vztah mezi velikostí neuronových sítí a efektivitou viceúkolového trénování. Závěrem provedených experimentů je, že viceúkolové trénování s použitím artikulačnich charakteristik jako sekundárních úkolů vede k lepšímu trénování neuronových sítí a výsledkem tohoto trénování může být přesnější rozpoznávání fonémů. V závěru práce jsou viceúkolové neuronové sítě testovány v systému rozpoznávání řeči jako extraktor příznaků.
Pedestrian Identification
Jurča, Jan ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
This thesis deals with pedestrian identification from video sequence based on person, face and gait recognition. For person and face recognition are used pretrained networks. While for gait recognition is implemented and compared many different networks. Final pedestrian recognition is based on multimodal fusion realized by neural network. For the purpose of the work was created dataset, along with a set of tools that allow its almost automatic creation.
Automatic Chord Recognition Using Deep Neural Networks
Nodžák, Petr ; Bidlo, Michal (referee) ; Vašíček, Zdeněk (advisor)
This work deals with automatic chord recognition using neural networks. The problem was separated into two subproblems. The first subproblem aims to experimental finding of most suitable solution for a acoustic model and the second one aims to experimental finding of most suitable solution for a language model. The problem was solved by iterative method. First a suboptimal solution of the first subproblem was found and then the second one. A total of 19 acoustic and 12 language models were made. Ten training datasets was created for acoustic models and three for language models. In total, over 200 models were trained. The best results were achieved on acoustic models represented by convolutional networks together with language models represented by recurent networks with LSTM modules.
Efficiency of deep convolutional neural networks on an elementary classification task
Prax, Jan ; Dobrovský, Ladislav (referee) ; Škrabánek, Pavel (advisor)
In this thesis deep convolutional neural networks models and feature descriptor models are compared. Feature descriptors are paired with suitable chosen classifier. These models are a part of machine learning therefore machine learning types are described in this thesis. Further these chosen models are described, and their basics and problems are explained. Hardware and software used for tests is listed and then test results and results summary is listed. Then comparison based on the validation accuracy and training time of these said models is done.
Convolutional Networks for Document Layout Analysis
Endrych, David ; Herout, Adam (referee) ; Kodym, Oldřich (advisor)
The goal of this thesis is to create a tool for analyzig the page layouts of text documents. The problem is solved by convolution neural networks. The architecture chosen in this thesis is the U-Net architecture. The cross entropy error function with weight map is used for train the network model. Paragraph regions are obtained throught connected component analysis. Experiments are evaluated using the Symmetric Best Dice object metric. Experiments have shown that it is better to use all paragraph edges than to focus only on vertical paragraph edges. In addition, experiments show that batche sampling strategies and adaptive resolution help to improve analysis results. The experiments also describe the application of separators, which is useful in analyzing multi-column documents.
Photo Noise Reduction Using Deep Neural Networks
Tichý, Jonáš ; Juránek, Roman (referee) ; Španěl, Michal (advisor)
Obrazový šum je fundamentálním problémem v digitální fotografii. Cílem této práce je studium redukce šumu ve fotografiích pomocí hlubokých neuronových sítí. Dvě vybrané metody založené na hlubokých neuronových sítích, DnCNN a BRDNet, byly implementovány a jejich výkon byl změřen v několika experimentech. Kromě toho byl navržen a proveden experiment na uživatelích s cílem vyhodnotit vnímanou kvalitu obrazu širokou veřejností. Experimenty ukázaly, že zatímco obě metody dosahují výborných výsledků v metrikách, jako je PSNR a SSIM, vnímaná vizuální kvalita ne vždy koreluje s numerickými metrikami. Výsledky prezentované v této práci zdůrazňují důležitost vhodných trénovacích dat a metrik kvality obrazu v odšumování digitálních fotografií.
Deep Neural Networks for Classifying Objects in an Image
Mlynarič, Tomáš ; Zemčík, Pavel (referee) ; Hradiš, Michal (advisor)
This paper deals with classifying objects using deep neural networks. Whole scene segmentation was used as main algorithm for the classification purpose which works with video sequences and obtains information between two video frames. Optical flow was used for getting information from the video frames, based on which features maps of a~neural network are warped. Two neural network architectures were adjusted to work with videos and experimented with. Results of the experiments show, that using videos for image segmentation improves accuracy (IoU) compared to the same architecture working with images.

National Repository of Grey Literature : 69 records found   beginprevious16 - 25nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.