National Repository of Grey Literature 19 records found  1 - 10next  jump to record: Search took 0.00 seconds. 
Assessment of Uncertainty of Neural Net Predictions in the Tasks of Classification, Detection and Segmentation
Vlasák, Jiří ; Kohút, Jan (referee) ; Herout, Adam (advisor)
This work focuses on comparing three widely used methods for improving uncertainty estimations: Deep Ensembles, Monte Carlo Dropout, and Temperature Scaling. These methods are applied to six computer vision models that are pretrained as well as trained from scratch. The models are then evaluated on computer vision datasets for classification, semantic segmentation, and object detection using a wide range of metrics. The models are also evaluated on distorted versions of these datasets to measure their performance on out-of-distribution data.      These modified models achieve promising results. Ensembles outperform the other models by as high as 70 % in accuracy and 0.2 in IOU on the distorted MedSeg COVID-19 segmentation dataset while also outperforming the other models on the CIFAR-100 and FMNIST datasets.
Recognition of Driving Lane Borders in Video from On-Board Camera
Fridrich, David ; Kohút, Jan (referee) ; Herout, Adam (advisor)
This paper talks about lane detection. Specifically custom generator of synthetic images, usage during training of neural networks, testing on convolutional neural network (CNN) UNet model and possibilities of extension of this model to SALMnet (Structure-Aware Lane Marking Detection Network) via addding SGCA module (semantic-guided channel attention) and PDC module (pyramid deformable convolution). Training results from synthetic datasets show very accurate results, reaching around 95\,\% in accuracy (even 99\,\% for easier images). Trainings with real datasets show lower accuracy, depending on the difficulty of the dataset itself. TuSimple has easier and clearer images and reaches about 62\,\%. CuLane is much more complex and results show accuracy around 37\,\%.
Text Recognition Enhanced by Writer Identity
Trněný, Matěj ; Kišš, Martin (referee) ; Kohút, Jan (advisor)
The objective of this theses was to implement a neural network for text recognition enhanced by writers identity. Adversarial learning method was selected for this purpose. Usefulness of this method was verified by experiments. This net should yield better results on data which are not similar to data contained in training data set. Accuracy of the resulting net was compared to method single-task learning and method multi-task learning. Net implementing single-task learning method has reached average character recognition error of 7, 995%, net implementing multi-task learning method has reached average error of 7, 565% and net implementing adversarial learning method has reached average error of 7, 573%. In comparison to the net implementing single-task learning multi-task learning has improvement of 5, 38% and adversarial learning has reached improvement of 5, 28%. 
Generating Animations with Neural Networks
Dráber, Filip ; Kohút, Jan (referee) ; Hradiš, Michal (advisor)
Ačkoli je snímání pohybu už tak nástrojem, který má animátorům pomoci zjednodušit ty nejsložitější aspekty tvorby realistických animací, spousta námahy je stále ukrytá v anotování a strukturalizaci těchto dat. Tento problém řeším návrhem neuronové sítě, která může být natrénována na datovém souboru nasnímaného pohybu tak, aby reprodukovala lidský pohyb, který je vizualizován v aplikaci, které umožňuje uživateli tento pohyb ovládat. Také experimentuji s různými metodami trénování autoregresivního modelu, a na základě toho určuji, která metoda nejlépe vyvažuje dobu trénování a výkon. Dalším postřehem je, jak přidání ovládacích hodnot do vlastností generovaných snímků ovlivňuje použití rekurentních neuronových sítí pro tento úkol.
Adaptation of Neural Networks to Target Writer
Sekula, Jakub ; Hradiš, Michal (referee) ; Kohút, Jan (advisor)
This bachelor's thesis deals with the adaptation of neural networks to a specific writer with an aim to improve recognition of handwritten text of this specific writer. The method that I use is fast, requires small training dataset and uses regularization, which tries to keep the distribution of regularized weights in adaptation network similar to the one in the original network. I tested this method on dataset of printed text called IMPACT and dataset of handwritten text. When testing on dataset of handwritten text I was able to improve recognition on two diaries with pre adaptation recognition error rate of 10,82 % and 1,82 % to 8,48 % and 0,77 % with a small number of adaptation iterations and using small amount of training lines. When testing on IMPACT dataset I was able to improve recognition error rate from 32,88 % to 5,30 %.
Learning to Generate Images with Convolutional Neural Networks
Kohút, Jan ; Kolář, Martin (referee) ; Hradiš, Michal (advisor)
The aim of this Bachelor's thesis is to design and analyze convolutional neural networks generating images of characters based on their parameters. Parameters of characters are type of char, font, colour of character, background colour, translation and rotation. Neural networks have created multidimensional representation of each parameter. Relations inside these representation are similar to relations inside parameters. Neural networks generate characters with new values of parameters based on interpolation between learned values of parameters. Neural networks are capable to generalize problem of generating images.
Transformer Neural Networks for Handwritten Text Recognition
Vešelíny, Peter ; Beneš, Karel (referee) ; Kohút, Jan (advisor)
This Master's thesis aims to design a system using the transformer neural network and perform experiments with this proposed model in the task of handwriting text recognition. In this thesis, a multilingual dataset with predominate Czech texts is used. The experiments examine the influence of basic hyperparameters, such as network size, convolutional encoder type, and the use of different text tokenizers. In this work, I also use text corpora of the Czech language which is used to train the network decoder. Furthermore, I experiment with the usage of additional textual information during the decoding process. This information comes from the previous line of the transcribed image. The transformer achieves a character recognition error rate of 3.41 % on the test data set which is 0.16 % worse performance than the recurrent neural network achieves. To compare this model with other transformer-based models from available articles, the network was trained on the IAM dataset, where it achieved an error of 2.48 % and therefore outperformed other models in handwriting text recognition task.
Active Learning for OCR
Kohút, Jan ; Kolář, Martin (referee) ; Hradiš, Michal (advisor)
The aim of this Master's thesis is to design methods of active learning and to experiment with datasets of historical documents. A large and diverse dataset IMPACT of more than one million lines is used for experiments. I am using neural networks to check the readability of lines and correctness of their annotations. Firstly, I compare architectures of convolutional and recurrent neural networks with bidirectional LSTM layer. Next, I study different ways of learning neural networks using methods of active learning. Mainly I use active learning to adapt neural networks to documents that the neural networks do not have in the original training dataset. Active learning is thus used for picking appropriate adaptation data. Convolutional neural networks achieve 98.6\% accuracy, recurrent neural networks achieve 99.5\% accuracy. Active learning decreases error by 26\% compared to random pick of adaptations data.
Deep Neural Networks for Historical Document Classification
Pinkeová, Bettina ; Kohút, Jan (referee) ; Kišš, Martin (advisor)
The aim of this work is to create a system for historical documents classification . The task is specifically about classification of documents according to the place of origin. Several systems are proposed for solving this problem, in the work. The first designed and implemented system is based on a convolutional neural network with a self-attention mechanism instead of an average pooling layer. Another system is based on the BEiT model, which is built on a visual transformer. The BEiT model was pretrained on the task of masked image modelling and subsequently trained on the given classification task. The system based on convolutional neural network achieved an accuracy of 81.6% and the system based on masked image modelling achieved an accuracy of 82.9%. The systems implemented in this work, surpassed the systems participating in the ICDAR 2021 conference in terms of success.
Fast Discriminative Neural Networks for Text Correction
Chupáč, Sebastián ; Beneš, Karel (referee) ; Kohút, Jan (advisor)
The goal of this work is to propose and implement a fast discriminating neural network with only one forward pass, to detect and correct mistakes in text data. Multiple architectures were implemented for detection and correction separately. These models make use of convolution layers, LSTM layers and CTC loss function. Models were trained and evaluated on datasets made from three different text corpora. Experiments and evaluation present the ability of these models to detect and correct mistakes on character level with only one, fast forward pass.

National Repository of Grey Literature : 19 records found   1 - 10next  jump to record:
See also: similar author names
1 Kohút, Jakub
4 Kohút, Jiří
4 Kohút, Josef
Interested in being notified about new results for this query?
Subscribe to the RSS feed.