National Repository of Grey Literature 132 records found  beginprevious101 - 110nextend  jump to record: Search took 0.01 seconds. 
Reconstruction of Missing Parts of the Face Using Neural Network
Marek, Jan ; Drahanský, Martin (referee) ; Goldmann, Tomáš (advisor)
Cílem této práce je vytvořit neuronovou síť která bude schopna rekonstruovat obličeje z fotografií na kterých je část obličeje překrytá maskou. Jsou prezentovány koncepty využívané při vývoji konvolučních neuronových sítí a generativních kompetitivních sítí. Dále jsou popsány koncepty používané v neuronových sítích specificky pro rekonstrukci fotografií obličejů. Je představen model generativní kompetitivní sítě využívající kombinaci hrazených konvolučních vrstev a víceškálových bloků schopný realisticky doplnit oblasti obličeje zakryté maskou.
Time-Domain Neural Network Based Speaker Separation
Peška, Jiří ; Černocký, Jan (referee) ; Žmolíková, Kateřina (advisor)
A thesis is about the usage of convolutional neural networks for automatic speech separation in an acoustic environment. The goal is to implement the neural network by following a TasNet architecture in the PyTorch framework, train this network with various values of hyper-parameters, and to compare the quality of separations based on the size of the network. In contrast to older architectures that transformed an input mixture into a time-frequency representation, this architecture uses a convolutional autoencoder, which transforms input mixture into a non-negative representation optimized for a speaker extraction. Separation is achieved by applying the masks, which are estimated in the separation module. This module consists of stacked convolutional blocks with increasing dilation, which helps with modeling of the long-term time dependencies in processed speech. Evaluation of the precision of the network is measured by a signal to distortion (SDR) metric, by a perceptual evaluation of speech quality (PESQ), and the short-time objective intelligibility (STOI). The Wall Street Journal dataset (WSJ0) has been used for training and evaluation. Trained models with various values of hyper-parameters enable us to observe the dependency between the size of the network and SDR value. While smaller network after 60 epochs of training reached 10.8 dB of accuracy, a bigger network reached 12.71 dB.
Impact of color models on performance of convolutional neural networks
Šimunský, Martin ; Doležel, Petr (referee) ; Škrabánek, Pavel (advisor)
Current knowledge about impact of colour models on performance of convolutional neural network is investigated in the first part of this thesis. The experiment based on obtained knowledge is conducted in the second part. Six colour models HSV, CIE 1931 XYZ, CIE 1976 L*a*b*, YIQ a YCbCr and deep convolutional neural network ResNet-101 are used. RGB colour model achieved the highest classification accuracy, whereas HSV color model has the lowest accuracy in this experiment.
Radar Signal Processing and Fusion of Information
Reich, Bořek ; Maršík, Lukáš (referee) ; Zemčík, Pavel (advisor)
This bachelor's thesis focuses on fusion of millimetr-wave radar and camera. It proposes appropriate procedure and usage of these sensors for object detection. Object detection in this bachelor's thesis is focused on people and provides additional information about detected person. It proposes convolution neural network as means of person detection and fusion of mmWave radar and camera data. When person is detected, distance of person from sensors is found in mmWave radar point cloud. Testing is performed on input data from both sensors in different situations, in poorly lit, unkwonwn scenes, with unknown people etc. Distance measuring is validated with reference data.
Detection, Tracking and Classification of Vehicles
Vopálenský, Radek ; Herout, Adam (referee) ; Juránek, Roman (advisor)
The aim of this master thesis is to design and implementation in language C++ a system for the detection, tracking and classification of vehicles from streams or records from traffic cameras. The system runs on the platform Robot Operating System and uses the OpenCV, FFmpeg, TensorFlow and Keras libraries. For detection is used cascade classifier, for tracking Kalman filter and for classification of the convolutional neural network. Success rate for detection is 91.93 %, tracking 81.94 % and classification 63.72 %. This system is part of a comprehensive system, that can moreover calibrate video and measure of vehicles speed. The resulting system can be used for traffic analysis.
Navigation Using Deep Convolutional Networks
Skácel, Dalibor ; Veľas, Martin (referee) ; Hradiš, Michal (advisor)
This thesis studies navigation and autonomous driving using convolutional neural networks. It presents main approaches to this problem used in literature. It describes theory of neural networks and imitation and reinforcement learning. It also describes tools and methods suitable for a driving system. There are two simulation driving models created using learning algorithms DAGGER and DDPG. The models are then tested in car racing simulator TORCS. 
Automatic tagging of musical compositions using machine learning methods
Semela, René ; Galáž, Zoltán (referee) ; Kiska, Tomáš (advisor)
One of the many challenges of machine learning are systems for automatic tagging of music, the complexity of this issue in particular. These systems can be practically used in the content analysis of music or the sorting of music libraries. This thesis deals with the design, training, testing, and evaluation of artificial neural network architectures for automatic tagging of music. In the beginning, attention is paid to the setting of the theoretical foundation of this field. In the practical part of this thesis, 8 architectures of neural networks are designed (4 fully convolutional and 4 convolutional recurrent). These architectures are then trained using the MagnaTagATune Dataset and mel spectrogram. After training, these architectures are tested and evaluated. The best results are achieved by the four-layer convolutional recurrent neural network (CRNN4) with the ROC-AUC = 0.9046 ± 0.0016. As the next step of the practical part of this thesis, a completely new Last.fm Dataset 2020 is created. This dataset uses Last.fm and Spotify API for data acquisition and contains 100 tags and 122877 tracks. The most successful architectures are then trained, tested, and evaluated on this new dataset. The best results on this dataset are achieved by the six-layer fully convolutional neural network (FCNN6) with the ROC-AUC = 0.8590 ± 0.0011. Finally, a simple application is introduced as a concluding point of this thesis. This application is designed for testing individual neural network architectures on a user-inserted audio file. Overall results of this thesis are similar to other papers on the same topic, but this thesis brings several new findings and innovations. In terms of innovations, a significant reduction in the complexity of individual neural network architectures is achieved while maintaining similar results.
Face superresolution from image sequence
Mezina, Anzhelika ; Rajnoha, Martin (referee) ; Burget, Radim (advisor)
Táto práce se zabývá použitím hlubokého učení neuronových sítí ke zvýšení rozlišení obrázků, které obsahují obličeje. Tato metoda najde uplatnění v různých oblastech, zejména v bezpečnosti, například, při bezpečnostním incidentu, kdy policie potřebuje identifikovat podezřelého z nahraného videa ze sledovací kamery. Cílem této práce je navrhnout minimálně dvě architektury neuronových sítí, které budou pracovat se sekvencí snímků, a porovnat je s metodami zpracování jediného snímku. Pro tento účel je také vytvořena nová trénovací množina, obsahující sekvenci snímku obličeje. Metody zpracování jednoho snímku jsou natrénované na nové množině. Dále jsou navrženy nové metody zvětšení obrázků na základě sekvence snímků. Tyto metody jsou založené na U-Net modelu, který je úspěšný v segmentaci, ale také v superrozlišení. Pro zlepšení architektury byly použity reziduální bloky a jejich modifikace, a navíc také percepční ztrátová funkce, která dovoluje vyhnout se rozmazání a získání více detailů. První čast této práce je věnovana popisu neuronových sítí a některých architektur, jejichž modifikace mohou být použity v superrozlišení. Druhá část se poté zabývá popisem metod pro zvýšení rozlišení obrazu pomocí jednoho snímku, několika snímků a videa. Ve třetí části jsou popsány navržené metody a experimenty a v poslední části porovnaná metod založených na jednom snímku a několika snímcích. Navržené metody jsou schopny získat více detailů v obraze, ale mohou produkovat artefakty. Ty lze ale poté eliminovat pomocí filtru, například Gaussova. Nové metody méně selhávají při detekci obličejů, a to je podstatné u identifikace člověka v případě incidentu.
Advanced classification of cardiac arrhythmias in ECG
Sláma, Štěpán ; Hejč, Jakub (referee) ; Ronzhina, Marina (advisor)
This work focuses on a theoretical explanation of heart rhythm disorders and the possibility of their automatic detection using deep learning networks. For the purposes of this work, a total of 6884 10-second ECG recordings with measured eight leads were used. Those recordings were divided into 5 groups according to heart rhythm into a group of records with atrial fibrillation, sinus rhythms, supraventricular rhythms, ventricular rhythms, and the last group consisted of the others records. Individual groups were unbalanced represented and more than 85 % of the total number of data are sinus rhythm group records. The used classification methods served effectively as a record detector of the largest group and the most effective of all was a procedure consisting of a 2D convolutional neural network into which data entered in the form of scalalograms (classification procedure number 3). It achieved results of precision of 91%, recall of 96% and F1-score values of 0.93. On the contrary, when classifying all groups at the same time, there were no such quality results for all groups. The most efficient procedure seems to be a variant composed of PCA on eight input signals with the gain of one output signal, which becomes the input of a 1D convolutional neural network (classification procedure number 5). This procedure achieved the following F1-score values: 1) group of records with atrial fibrillation 0.54, 2) group of sinus rhythms 0.91, 3) group of supraventricular rhythms 0.65, 4) group of ventricular rhythms 0.68, 5) others records 0.65.
Detection and measurement of electron beam in TEM images
Polcer, Simon ; Vičar, Tomáš (referee) ; Chmelík, Jiří (advisor)
This diploma thesis deals with automatic detection and measurement of the electron beam in the images from a transmission electron microscope (TEM). The introduction provides a description of the construction and the main parts of the electron microscope. In the theoretical part, there are summarized modes of illumination from the fluorescent screen. Machine learning, specifically convolution neural network U-Net is used for automatic detection of the electron beam in the image. The measurement of the beam is based on ellipse approximation, which defines the size and dimension of the beam. Neural network learning requires an extensive database of images. For this purpose, the own augmentation approach is proposed, which applies a specific combination of geometric transformations for each mode of illumination. In the conclusion of this thesis, the results are evaluated and summarized. This proposed algorithm achieves 0.815 of the DICE coefficient, which describes an overlap between two sets. The thesis was designed in Python programming language.

National Repository of Grey Literature : 132 records found   beginprevious101 - 110nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.