National Repository of Grey Literature 89 records found  beginprevious21 - 30nextend  jump to record: Search took 0.01 seconds. 
Deep Learning for Image Stitching
Držíková, Diana Maxima ; Vaško, Marek (referee) ; Španěl, Michal (advisor)
Zošívanie obrázkov nie je taký neznámy pojem ako sa na prvý pohľad môže zdať. Určite každý bežný používateľ technológií sa už zozámil s pojmom panoramatický obrázok. V pozadí na zariadení sa prekrývajúce sa obrázky zošívajú a tým vzniká vysoko kvalitný obrázok. Na to aby tento proces fungoval, existujúce algorimy musia spoľahlivo a presne detekovať zaujímavé body, podľa ktorých sa dokáže obrázok správne umiesniť. V tejto práci budú predstavené tradičné metódy na zošívanie obrázkov a taktiež aj metódy s pomocou hlbokých neurónových sietí. Hlavné dva modely, ktoré budú opísane a použíté sú implementácie SuperPoint a SuperGlue. Implementácia bude adaptovaná na párovací systém pre viac ako dva obrázky. Ostatné experimenty, ktoré boli vyskúšané a dopomohli k pochopeniu tejto problematiky budú opísane a vyhodnotené.
Video Enhancement Using Convolutional Networks
Skácel, David ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
Convolutional neural networks (CNN) represent a state-of-the-art approach to non-trivial image processing tasks, including compression artifacts reduction and image super-resolution. As some research groups nowadays show, these networks can also be leveraged to perform such tasks on real-world video data, resulting in video spatial super-resolution and more. The main goal of this work is to determine whether these nets can be adjusted to perform temporal super-resolution of real-world video data. I utilize the aforementioned neural net architectures in this paper to do so. As I show, given that the input videos are of reasonable quality, these nets are capable of double-image interpolation up to a certain level, where the output image is usable for temporal upsampling. Although the presented results are promising, I encourage more research to be done on this topic.
Deep Learning for Facial Recognition in Video
Jeřábek, Vladimír ; Sochor, Jakub (referee) ; Hradiš, Michal (advisor)
This work deals with face recognition in video using neural networks. In the beginning, there is described the process of selection and verification of convolution neural network to generate feature vectors from images of different identities. In the next part, this work deals with the aggregation of feature vectors from video frames. Aggregation takes place through aggregation neural networks. At the end of this work, the results obtained by the aggregation methods are discussed.
Deep Learning for Facial Recognition in Video
Stratil, Jan ; Sochor, Jakub (referee) ; Hradiš, Michal (advisor)
This bachelor's thesis deals with facial recognition in video using deep neural networks. This task is split into 2 parts. The first part deals with training network that produces compact feature vector which represents the face identity from a video frame. The second part deals with training aggregation network that aggregates those feature vectors into one. This aggregation is fast and it has shown that its results are better than naive pooling methods. Results are tested on the LFW dataset, where it achieves 92.8% accuracy and on the YTF dataset, where the accuracy is 84.06%.
Deep Learning for Image Classification
Ziková, Jana ; Veľas, Martin (referee) ; Hradiš, Michal (advisor)
This bachelor thesis deals with electronic commerce website products classification using product's photographs. For this purpose we use already implemented models of deep convolutional neural networks. Tho goal of this theses is to design experiments that will lead to the best possible results in product images classification.
Multi-Task Neural Networks for Speech Recognition
Egorova, Ekaterina ; Veselý, Karel (referee) ; Karafiát, Martin (advisor)
První část této diplomové práci se zabývá teoretickým rozborem principů neuronových sítí, včetně možnosti jejich použití v oblasti rozpoznávání řeči. Práce pokračuje popisem viceúkolových neuronových sítí a souvisejících experimentů. Praktická část práce obsahovala změny software pro trénování neuronových sítí, které umožnily viceúkolové trénování. Je rovněž popsáno připravené prostředí, včetně několika dedikovaných skriptů. Experimenty představené v této diplomové práci ověřují použití artikulačních characteristik řeči pro viceúkolové trénování. Experimenty byly provedeny na dvou řečových databázích lišících se kvalitou a velikostí a representujících různé jazyky - angličtinu a vietnamštinu. Artikulační charakteristiky byly také kombinovány s jinými sekundárními úkoly, například kontextem, s záměrem ověřit jejich komplementaritu. Porovnaní je provedeno s neuronovými sítěmi různých velikostí tak, aby byl popsán vztah mezi velikostí neuronových sítí a efektivitou viceúkolového trénování. Závěrem provedených experimentů je, že viceúkolové trénování s použitím artikulačnich charakteristik jako sekundárních úkolů vede k lepšímu trénování neuronových sítí a výsledkem tohoto trénování může být přesnější rozpoznávání fonémů. V závěru práce jsou viceúkolové neuronové sítě testovány v systému rozpoznávání řeči jako extraktor příznaků.
Automatic Pronunciation Evaluation of Non-Native English Speakers
Gazdík, Peter ; Szőke, Igor (referee) ; Žmolíková, Kateřina (advisor)
Computer-Assisted Pronunciation Training (CAPT) is becoming more and more popular these days. However, the accuracy of existing CAPT systems is still quite low. Therefore, this diploma thesis focuses on improving existing methods for automatic pronunciation evaluation on the segmental level. The first part describes common techniques for this task. Afterwards, we proposed the system based on two approaches. Finally, performed experiments show significant improvement over the reference system.
Pedestrian Identification
Jurča, Jan ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
This thesis deals with pedestrian identification from video sequence based on person, face and gait recognition. For person and face recognition are used pretrained networks. While for gait recognition is implemented and compared many different networks. Final pedestrian recognition is based on multimodal fusion realized by neural network. For the purpose of the work was created dataset, along with a set of tools that allow its almost automatic creation.
Automatic Chord Recognition Using Deep Neural Networks
Nodžák, Petr ; Bidlo, Michal (referee) ; Vašíček, Zdeněk (advisor)
This work deals with automatic chord recognition using neural networks. The problem was separated into two subproblems. The first subproblem aims to experimental finding of most suitable solution for a acoustic model and the second one aims to experimental finding of most suitable solution for a language model. The problem was solved by iterative method. First a suboptimal solution of the first subproblem was found and then the second one. A total of 19 acoustic and 12 language models were made. Ten training datasets was created for acoustic models and three for language models. In total, over 200 models were trained. The best results were achieved on acoustic models represented by convolutional networks together with language models represented by recurent networks with LSTM modules.
Efficiency of deep convolutional neural networks on an elementary classification task
Prax, Jan ; Dobrovský, Ladislav (referee) ; Škrabánek, Pavel (advisor)
In this thesis deep convolutional neural networks models and feature descriptor models are compared. Feature descriptors are paired with suitable chosen classifier. These models are a part of machine learning therefore machine learning types are described in this thesis. Further these chosen models are described, and their basics and problems are explained. Hardware and software used for tests is listed and then test results and results summary is listed. Then comparison based on the validation accuracy and training time of these said models is done.

National Repository of Grey Literature : 89 records found   beginprevious21 - 30nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.