National Repository of Grey Literature 23 records found  1 - 10nextend  jump to record: Search took 0.00 seconds. 
Self-supervised learning in computer vision applications
Vančo, Timotej ; Richter, Miloslav (referee) ; Janáková, Ilona (advisor)
The aim of the diploma thesis is to make research of the self-supervised learning in computer vision applications, then to choose a suitable test task with an extensive data set, apply self-supervised methods and evaluate. The theoretical part of the work is focused on the description of methods in computer vision, a detailed description of neural and convolution networks and an extensive explanation and division of self-supervised methods. Conclusion of the theoretical part is devoted to practical applications of the Self-supervised methods in practice. The practical part of the diploma thesis deals with the description of the creation of code for working with datasets and the application of the SSL methods Rotation, SimCLR, MoCo and BYOL in the role of classification and semantic segmentation. Each application of the method is explained in detail and evaluated for various parameters on the large STL10 dataset. Subsequently, the success of the methods is evaluated for different datasets and the limiting conditions in the classification task are named. The practical part concludes with the application of SSL methods for pre-training the encoder in the application of semantic segmentation with the Cityscapes dataset.
Vehicle License Plate Detection and Recognition Software
Masaryk, Adam ; Hradiš, Michal (referee) ; Špaňhel, Jakub (advisor)
The aim of this bachelor thesis is to design and develop software that can detect and recognize license plates from images. The software is divided into 3 parts - license plates detection, detector output processing and license plates characters recognition. We decided to implement detection and recognition using modern methods using convolutional neural networks.
Pedestrian Attribute Analysis
Studená, Zuzana ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
This work deals with obtaining pedestrian information, which are captured by static, external cameras located in public, outdoor or indoor spaces. The aim is to obtain as much information as possible. Information such as gender, age and type of clothing, accessories, fashion style, or overall personality are obtained using using convolutional neural networks. One part of the work consists of creating a new dataset that captures pedestrians and includes information about the person's sex, age, and fashion style. Another part of the thesis is the design and implementation of convolutional neural networks, which classify the mentioned pedestrian characteristics. Neural networks evaluate pedestrian input images in PETA, FashionStyle14 and BUT Pedestrian Attributes datasets. Experiments performed over the PETA and FashionStyle datasets compare my results to various convolutional neural networks described in publications. Further experiments are shown on created BUT data set of pedestrian attributes.
Learning the Face Behind a Voice
Zubalík, Petr ; Mošner, Ladislav (referee) ; Plchot, Oldřich (advisor)
The main goal of this thesis is to design and implement a system that will be able to generate a face based on the speech of a given person. This problem is solved using a system composed of three convolutional neural network models. The first one is based on the ResNet architecture and is used to extract features from speech recordings. The second model is a fully convolutional neural network which converts the extracted features into the styles which form a base for the final facial image. These styles are then passed as an input to the StyleGAN generator, which creates the resulting face. The proposed system is implemented in the Python programming language using the PyTorch framework. The last chapter of the thesis discusses some of the most significant experiments performed to fine-tune and test the developed system.
Classification of thorax diseases on chest X-ray images using artificial intelligence
Pijáček, Štěpán ; Mikulec, Marek (referee) ; Mezina, Anzhelika (advisor)
Tato práce se zabývá výzkumem použitelných řešení pro problém klasifikace onemocnění hrudníku na rentgenových snímcích hrudníku pomocí umělé inteligence. Pro lepší pochopení problému jsou v prvních kapitolách vysvětleny základní konvoluční neuronové sítě a jejich výhody a nevýhody. Na základě těchto prvních vysvětlení jsou vybrány dvě neuronové sítě, které rozšiřují koncept konvoluční neuronové sítě. Těmito sítěmi jsou kapslová síť a reziduální síť, obě jsou dále vysvětleny v příslušných kapitolách s jejich výhodami a nevýhodami. Reziduální síť a kapslová síť jsou poté implementovány pomocí programovacího jazyka python a frameworku TensorFlow s knihovnou Keras, obě se svými příslušnými kapitolami. Na konci práce jsou uvedeny výsledky a závěr.
Deep neural network for supercomputer environments
Bronda, Samuel ; Kolařík, Martin (referee) ; Burget, Radim (advisor)
The main benefit of the work is the optimization of the hardware configuration for the calculation of neural networks. The theoretical part describes neural networks, deep learning frameworks and hardware options. The next part of the thesis deals with implementation of performance tests, which include application of Inception V3 and ResNet models. Network models are applied to various graphics cards and computing hardware. The output of the thesis is the implemented model of the network Inception V3, which examines the graphics cards and their performance, time-consuming calculations and their efficiency. The ResNet model is applied to a section that examines other impacts on neural network computing such as used disk, operating memory, and so on. Each practical part contains a discussion where the knowledge of the given part is explained. In the case of consumption measurement, a mismatch between the declaration by the manufacturer and the measured values was identified.
Graffiti Tags Re-Identification
Pavlica, Jan ; Beran, Vítězslav (referee) ; Špaňhel, Jakub (advisor)
This thesis focuses on the possibility of using current methods in the field of computer vision to re-identify graffiti tags. The work examines the possibility of using convolutional neural networks to re-identify graffiti tags, which are the most common type of graffiti. The work experimented with various models of convolutional neural networks, the most suitable of which was MobileNet using the triplet loss function, which managed to achieve a mAP of 36.02%.
Efficiency of deep convolutional neural networks on an elementary classification task
Prax, Jan ; Dobrovský, Ladislav (referee) ; Škrabánek, Pavel (advisor)
In this thesis deep convolutional neural networks models and feature descriptor models are compared. Feature descriptors are paired with suitable chosen classifier. These models are a part of machine learning therefore machine learning types are described in this thesis. Further these chosen models are described, and their basics and problems are explained. Hardware and software used for tests is listed and then test results and results summary is listed. Then comparison based on the validation accuracy and training time of these said models is done.
Reconstruction of Facial Images Using Neural Networks
Zubalík, Petr ; Drahanský, Martin (referee) ; Goldmann, Tomáš (advisor)
The main purpose of this bachelor's thesis is to propose and implement a model, using neural networks, that will be able to reconstruct low-resolution facial images with blurry parts of the face. The task of super-resolution of facial images is solved by two models based on convolutional neural networks. The first model is built upon the architecture of ResNet whereas the other model makes use of the principles of generative adversarial networks. The proposed models are implemented in the Python programming language with the use of application programming interface of the TensorFlow framework. Moreover, as a part of this work, an application with a simple grafical user interface was created. This application makes it easy to use the implemented models. Several experiments are analyzed in the last chapter of this thesis to evaluate the performance of the models.
Identification of supraventricular tachycardia segments using multiple-instance learning
Abbrent, Jakub ; Novotná, Petra (referee) ; Ronzhina, Marina (advisor)
Supraventricular tachycardias have a high incidence in the population and often cause health disorders. The aim of this thesis is to automatically detect and localize atrial fibrillation in ECG records. The algorithm, implemented in Python, uses a convolutional neural network ResNet for detection with multiple-instance learning and decision rules. The output of the detection in the form of a feature signal is used for localization. The classification achieved F1 score of 0.87 on the test dataset. Then, paroxysmal atrial fibrillation was localized with a deviation of -0.40±2.26 seconds for the onsets and 1.09±2.75 seconds for the terminations of the episodes. Lastly, the obtained results are evaluated and discussed.

National Repository of Grey Literature : 23 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.