National Repository of Grey Literature 26 records found  1 - 10nextend  jump to record: Search took 0.01 seconds. 
Retinal Images Generation with a Limited Amount of Training Data
Senichak, Yahor ; Semerád, Lukáš (referee) ; Kavetskyi, Andrii (advisor)
The purpose of this study is to explore the progress and application of computer vision and generative adversarial networks (GANs3.1) in the diagnosis and study of fundus diseases. Particular attention is paid to the latest advances in the field of medical data synthesis and the development of our own algorithm. Recent advances in the deep learning architecture U-GAT-IT [22], which includes two pairs of deep neural networks (two generators and two discriminators), have been implemented. This implementation was trained for approximately 300,000 iterations, during which positive results were obtained. The dynamics of the training process were recorded and tests were performed to demonstrate the ability to generate high-quality synthetic images of the ocular background independent of the input data
Tracking people based on their clothing in multi-camera systems
Sivak, Mykyta ; Přinosil, Jiří (referee) ; Číka, Petr (advisor)
This bachelor thesis focuses on the development and implementation of an algorithm for tracking individuals in multi-camera systems based on clothing pattern analysis. The aim was to design a system capable of tracking an individual in various positions and frames, using the Region of Interest (RoI) technique. The study begins with a comprehensive review of the existing literature on object tracking in video sequences, with a special focus on RoI tracking techniques. During the research, a new algorithm was developed and implemented that utilizes clothing patterns as the primary identification element for tracking and re-identifying individuals across different camera shots. The algorithm was experimentally validated on datasets containing video sequences from various environments, allowing for a detailed analysis of its effectiveness and reliability. The experimental results demonstrate that the proposed system achieves significant accuracy and efficiency compared to traditional methods and is particularly effective in challenging situations where other methods fail. The thesis concludes with an evaluation of the conducted experiments along with recommendations for future extensions and improvements of the system. Potential challenges and ethical aspects, including issues of privacy and personal data processing, are also discussed.
Algorithm for Facial Image Quality Estimation
Husár, Tomáš ; Sakin, Martin (referee) ; Goldmann, Tomáš (advisor)
The precision of face recognition algorithms is heavily influenced by the quality of input images. The aim of the work is to evaluate the quality of face images using a convolutional neural network. The data on which the testing was carried out were created by various degradations of photos from the CelebA dataset. The resulting application determines the quality of images based on the predicted probabilities of individual degradations.
Self-supervised learning in computer vision applications
Vančo, Timotej ; Richter, Miloslav (referee) ; Janáková, Ilona (advisor)
The aim of the diploma thesis is to make research of the self-supervised learning in computer vision applications, then to choose a suitable test task with an extensive data set, apply self-supervised methods and evaluate. The theoretical part of the work is focused on the description of methods in computer vision, a detailed description of neural and convolution networks and an extensive explanation and division of self-supervised methods. Conclusion of the theoretical part is devoted to practical applications of the Self-supervised methods in practice. The practical part of the diploma thesis deals with the description of the creation of code for working with datasets and the application of the SSL methods Rotation, SimCLR, MoCo and BYOL in the role of classification and semantic segmentation. Each application of the method is explained in detail and evaluated for various parameters on the large STL10 dataset. Subsequently, the success of the methods is evaluated for different datasets and the limiting conditions in the classification task are named. The practical part concludes with the application of SSL methods for pre-training the encoder in the application of semantic segmentation with the Cityscapes dataset.
Vehicle License Plate Detection and Recognition Software
Masaryk, Adam ; Hradiš, Michal (referee) ; Špaňhel, Jakub (advisor)
The aim of this bachelor thesis is to design and develop software that can detect and recognize license plates from images. The software is divided into 3 parts - license plates detection, detector output processing and license plates characters recognition. We decided to implement detection and recognition using modern methods using convolutional neural networks.
Pedestrian Attribute Analysis
Studená, Zuzana ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
This work deals with obtaining pedestrian information, which are captured by static, external cameras located in public, outdoor or indoor spaces. The aim is to obtain as much information as possible. Information such as gender, age and type of clothing, accessories, fashion style, or overall personality are obtained using using convolutional neural networks. One part of the work consists of creating a new dataset that captures pedestrians and includes information about the person's sex, age, and fashion style. Another part of the thesis is the design and implementation of convolutional neural networks, which classify the mentioned pedestrian characteristics. Neural networks evaluate pedestrian input images in PETA, FashionStyle14 and BUT Pedestrian Attributes datasets. Experiments performed over the PETA and FashionStyle datasets compare my results to various convolutional neural networks described in publications. Further experiments are shown on created BUT data set of pedestrian attributes.
Learning the Face Behind a Voice
Zubalík, Petr ; Mošner, Ladislav (referee) ; Plchot, Oldřich (advisor)
The main goal of this thesis is to design and implement a system that will be able to generate a face based on the speech of a given person. This problem is solved using a system composed of three convolutional neural network models. The first one is based on the ResNet architecture and is used to extract features from speech recordings. The second model is a fully convolutional neural network which converts the extracted features into the styles which form a base for the final facial image. These styles are then passed as an input to the StyleGAN generator, which creates the resulting face. The proposed system is implemented in the Python programming language using the PyTorch framework. The last chapter of the thesis discusses some of the most significant experiments performed to fine-tune and test the developed system.
Classification of thorax diseases on chest X-ray images using artificial intelligence
Pijáček, Štěpán ; Mikulec, Marek (referee) ; Mezina, Anzhelika (advisor)
Tato práce se zabývá výzkumem použitelných řešení pro problém klasifikace onemocnění hrudníku na rentgenových snímcích hrudníku pomocí umělé inteligence. Pro lepší pochopení problému jsou v prvních kapitolách vysvětleny základní konvoluční neuronové sítě a jejich výhody a nevýhody. Na základě těchto prvních vysvětlení jsou vybrány dvě neuronové sítě, které rozšiřují koncept konvoluční neuronové sítě. Těmito sítěmi jsou kapslová síť a reziduální síť, obě jsou dále vysvětleny v příslušných kapitolách s jejich výhodami a nevýhodami. Reziduální síť a kapslová síť jsou poté implementovány pomocí programovacího jazyka python a frameworku TensorFlow s knihovnou Keras, obě se svými příslušnými kapitolami. Na konci práce jsou uvedeny výsledky a závěr.
Deep neural network for supercomputer environments
Bronda, Samuel ; Kolařík, Martin (referee) ; Burget, Radim (advisor)
The main benefit of the work is the optimization of the hardware configuration for the calculation of neural networks. The theoretical part describes neural networks, deep learning frameworks and hardware options. The next part of the thesis deals with implementation of performance tests, which include application of Inception V3 and ResNet models. Network models are applied to various graphics cards and computing hardware. The output of the thesis is the implemented model of the network Inception V3, which examines the graphics cards and their performance, time-consuming calculations and their efficiency. The ResNet model is applied to a section that examines other impacts on neural network computing such as used disk, operating memory, and so on. Each practical part contains a discussion where the knowledge of the given part is explained. In the case of consumption measurement, a mismatch between the declaration by the manufacturer and the measured values was identified.
Graffiti Tags Re-Identification
Pavlica, Jan ; Beran, Vítězslav (referee) ; Špaňhel, Jakub (advisor)
This thesis focuses on the possibility of using current methods in the field of computer vision to re-identify graffiti tags. The work examines the possibility of using convolutional neural networks to re-identify graffiti tags, which are the most common type of graffiti. The work experimented with various models of convolutional neural networks, the most suitable of which was MobileNet using the triplet loss function, which managed to achieve a mAP of 36.02%.

National Repository of Grey Literature : 26 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.