National Repository of Grey Literature 71 records found  beginprevious60 - 69next  jump to record: Search took 0.01 seconds. 
Deep learning based QRS delineator
Malina, Ondřej ; Hejč, Jakub (referee) ; Smíšek, Radovan (advisor)
This thesis deals with the issue of automatic measurement of the duration of QRS complexes in ECG signals. Special emphasis is then placed on the possibility of automatic detection of QRS complexes while exciting cardiac tissue with a pacemaker. The content of this work is divided into four logical units, where the first part deals with the heart as an organ. It describes the origin and spread of excitement in the heart, its possible pathologies and their manifestations in ECG recording, it also deals with pacing and measuring ECG recording during simultaneous pacing. The second part of the thesis contains a brief introduction to the topic of machine and deep learning. The third part of the thesis contains a search of current approaches using methods based on deep learning to solve the detection of QRSd. The fourth part deals with the design and implementation of its own model of deep learning, able to detect the beginnings and ends of QRS complexes from ECG recordings. It describes the data preprocessing implemented in the MATLAB programming environment. The actual implementation of the model was performed in the Python using the PyTorch and NumPy moduls.
Self-supervised learning in computer vision applications
Vančo, Timotej ; Richter, Miloslav (referee) ; Janáková, Ilona (advisor)
The aim of the diploma thesis is to make research of the self-supervised learning in computer vision applications, then to choose a suitable test task with an extensive data set, apply self-supervised methods and evaluate. The theoretical part of the work is focused on the description of methods in computer vision, a detailed description of neural and convolution networks and an extensive explanation and division of self-supervised methods. Conclusion of the theoretical part is devoted to practical applications of the Self-supervised methods in practice. The practical part of the diploma thesis deals with the description of the creation of code for working with datasets and the application of the SSL methods Rotation, SimCLR, MoCo and BYOL in the role of classification and semantic segmentation. Each application of the method is explained in detail and evaluated for various parameters on the large STL10 dataset. Subsequently, the success of the methods is evaluated for different datasets and the limiting conditions in the classification task are named. The practical part concludes with the application of SSL methods for pre-training the encoder in the application of semantic segmentation with the Cityscapes dataset.
Object detection in video using neural networks
Mikulský, Petr ; Sikora, Pavel (referee) ; Myška, Vojtěch (advisor)
This diploma thesis deals with the detection of moving objects in a video recording using neural networks. The aim of the thesis was to detect road users in video recordings. Pre-trained YOLOv5 object detection model was used for a practical part of the thesis. As part of the solution, an own dataset of traffic road video recordings was created and annotated with following classes: a car, a bus, a van, a motorcycle, a truck and a trailer truck. Final version of this dataset comprise 5404 frames and 6467 annotated objects in total. After training, the YOLOv5 model achieved 0.995 mAP, 0.995 precision and 0.986 recall on the dataset. All steps leading to the final form of the dataset are described in the conclusion chapter.
Object Detection in the Laser Scans Using Convolutional Neural Networks
Zelenák, Michal ; Kodym, Oldřich (referee) ; Veľas, Martin (advisor)
This work is focused on road segmentation in laser scans, using a convolutional neural network. To achieve this goal, which will find application in the field of road maintenance, convolutional neural networks have been used for their flexibility and speed. The work brings implementation and modifications of the existing method, which solves the problem by using a fully connected convolutional neural network. Used modifications include, for example using of various parameters for the loss function, the use of a different number of classes in the network model and dataset. The effect of the modification was experimentally verified and the accuracy of 96.12%, and the value for F-measure 95.02% were achieved.
Low-Dimensional Matrix Factorization in End-To-End Speech Recognition Systems
Gajdár, Matúš ; Grézl, František (referee) ; Karafiát, Martin (advisor)
The project covers automatic speech recognition with neural network training using low-dimensional matrix factorization. We are describing time delay neural networks with factorization (TDNN-F) and without it (TDNN) in Pytorch language. We are comparing the implementation between Pytorch and Kaldi toolkit, where we achieve similar results during experiments with various network architectures. The last chapter describes the impact of a low-dimensional matrix factorization on End-to-End speech recognition systems and also a modification of the system with TDNN(-F) networks. Using specific network settings, we were able to achieve better results with systems using factorization. Additionally, we reduced the complexity of training by decreasing network parameters with the use of TDNN(-F) networks.
Cell detection using convolutional neural networks
Doskočil, Ondřej ; Chmelík, Jiří (referee) ; Vičar, Tomáš (advisor)
This bachelor thesis deals with the use of convolutional neural networks for cell detection in image data. The theoretical part contains a description of the functioning of these networks and their various architectures. In the practical part, these networks were implemented and trained on an available dataset. However, each of these networks uses a different approach to detection. Finally, the individual networks were statistically evaluated and a discussion was conducted.
Detection of persons and evaluation of gender and age in image data
Dobiš, Lukáš ; Vičar, Tomáš (referee) ; Kolář, Radim (advisor)
Táto diplomová práca sa venuje automatickému rozpoznávaniu ludí v obrazových dátach s využitím konvolučných neurónových sieti na určenie polohy tváre a následnej analýze získaných dát. Výsledkom analýzy tváre je určenie pohlavia, emócie a veku osoby. Práca obsahuje popis použitých architektúr konvolučných sietí pre každú podúlohu. Sieť na odhad veku má natrénované nové váhy, ktoré sú vzápätí zmrazené a majú do svojej architektúry vložené LSTM vrstvy. Tieto vrstvy sú samostatne dotrénované a testované na novom datasete vytvorenom pre tento účel. Výsledky testov ukazujú zlepšenie predikcie veku. Riešenie pre rýchlu, robustnú a modulárnu detekciu tváre a ďalších ludských rysov z jedného obrazu alebo videa je prezentované ako kombinácia prepojených konvolučných sietí. Tieto sú implementované v podobe skriptu a následne vysvetlené. Ich rýchlosť je dostatočná pre ďalšie dodatočné analýzy tváre na živých obrazových dátach.
Obtaining and Processing of a Set of Vehicle License Plates
Kvapilová, Aneta ; Bartl, Vojtěch (referee) ; Herout, Adam (advisor)
This master thesis focuses on creating and processing a dataset, which contains semi-automatically processed images of vehicles licence plates. The main goal is to create videos and a set of tools, which are able to transform  input videos into a dataset used for traffic monitoring neural networks. Used programming language is Python, graphical library OpenCV and framework PyTorch for implementation of neural network.
Convolutional Networks for Lip Reading
Kadleček, Josef ; Kišš, Martin (referee) ; Hradiš, Michal (advisor)
This thesis deals with current methods for automatic speech recognition and lip reading via neural networks. Furthermore it deals with similarities in the architectures of neural networks for audio and visual data and available datasets in the field of audiovisual automatic speech recognition. The main contribution of this thesis is set of experiments comparing different changes in neural network architecture and its impact on results. The thesis includes an implementation of a system for automatic speech recognition from audio (CER: 12.6 %) and visual (CER: 57,7 %) data. The architectures of both systems are based on features extraction via convolutional networks followed by recurrent layers LSTM, another layer of convolutions and loss function CTC. 
Computer vision and hand gestures detection and fingers tracking
Bravenec, Tomáš ; Wyrzykowski, Roman (referee) ; Frýza, Tomáš (advisor)
Diplomová práce je zaměřena na detekci a rozpoznání gest rukou a prstů ve statických obrazech i video sekvencích. Práce obsahuje shrnutí několika různých přístupů k samotné detekci a také jejich výhody i nevýhody. V práci je též obsažena realizace multiplatformní aplikace napsané v Pythonu s použitím knihoven OpenCV a PyTorch, která dokáže zobrazit vybraný obraz nebo přehrát video se zvýrazněním rozpoznaných gest.

National Repository of Grey Literature : 71 records found   beginprevious60 - 69next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.