National Repository of Grey Literature 128 records found  beginprevious107 - 116nextend  jump to record: Search took 0.01 seconds. 
Image based smoke and fire detection
Ďuriš, Denis ; Burda, Karel (referee) ; Přinosil, Jiří (advisor)
This diploma thesis deals with the detection of fire and smoke from the image signal. The approach of this work uses a combination of convolutional and recurrent neural network. Machine learning models created in this work contain inception modules and blocks of long short-term memory. The research part describes selected models of machine learning used in solving the problem of fire detection in static and dynamic image data. As part of the solution, a data set containing videos and still images used to train the designed neural networks was created. The results of this approach are evaluated in conclusion.
Dance Recognition from Audio Recordings
Pavlín, Tomáš ; Čech, Jan (advisor) ; Moudřík, Josef (referee)
We propose a CNN-based approach to classify ten genres of ballroom dances given audio recordings, five latin and five standard, namely Cha Cha Cha, Jive, Paso Doble, Rumba, Samba, Quickstep, Slow Foxtrot, Slow Waltz, Tango and Viennese Waltz. We utilize a spectrogram of an audio signal and we treat it as an image that is an input of the CNN. The classification is performed independently by 5-seconds spectrogram segments in sliding window fashion and the results are then aggregated. The method was tested on following datasets: Publicly available Extended Ballroom dataset collected by Marchand and Peeters, 2016 and two YouTube datasets collected by us, one in studio quality and the other, more challenging, recorded on mobile phones. The method achieved accuracy 93.9%, 96.7% and 89.8% respectively. The method runs in real-time. We implemented a web application to demonstrate the proposed method.
Deep Learning for OCR in GUI
Hamerník, Pavel ; Špaňhel, Jakub (referee) ; Lysek, Tomáš (advisor)
Optical character recognition (OCR) has been a topic of interest for many years. It is defined as the process of digitizing a document image into a sequence of characters. Despite decades of intense research, OCR systems with capabilities to that of human still remains an open challenge. In this work there is presented a design and implementation of such system, which is capable of detecting texts in graphical user interfaces.
Convolutional Networks for Historic Text Recognition
Vešelíny, Peter ; Kolář, Martin (referee) ; Kišš, Martin (advisor)
This thesis deals with text line recognition of historical documents. Historical texts dating back to the 17th - 19th centuries are written in fraktur typeface. The character recognition problem is solved using neural network architecture called sequence-to-sequence . This architecture is based on encoder-decoder model and contains attention mechanism. In this thesis a dataset, from texts originated from German archiv called Deutsches Textarchiv , was created. This archive contains 3 897 different German books that have available transcripts and corresponding images of pages. The created dataset was used to train and experiment with the proposed neural network. During the experiments, several convolutional models, hyperparameters and the effects of positional embedding were investigated. The final tool can recognize characters with accuracy 99,63 %. The contribution of this work is the~mentioned dataset and neural network, which can be used to recognize historical documents.
Smartphone Game Using Recognition of Face Features
Skoták, Jiří ; Szőke, Igor (referee) ; Herout, Adam (advisor)
This master's thesis focuses on smartphone game for iOS, which uses recognition of face features and other information, which can be obtained from a smartphone's camera and sensors. This work describes a few approaches for real-time face detection and then introduces and compares possibilities for such task on iOS. Moreover, the thesis contains a draft of the final game and its levels. The game showcases various technologies in its levels such as object detection, processing an image color and others. Finally, the thesis introduces the final form of the game that is released and available on the App Store.
Weapon Detection in an Image
Debnár, Pavol ; Drahanský, Martin (referee) ; Dvořák, Michal (advisor)
This thesis is focused on the topic of firearms detection in images. In the theoretic section, the explanation of the term firearm is covered, along with the definition of the most prevalent firearm categories. Then the concept of image noise and the ways it can hinder image detection is covered, along  with ways of reducing it. Next, algorithms of image detection are introduced - first those which operate on the basis of neural nets - such as Convolutional Neural Nets and Single Shot Multibox Detection. The next section discusses classic algorithms of object detection such as HOG+SVM and SURF. After that, information on the used libraries and software is provided. The experimental part covers the designed algorithm and database. For detection, the HOG+SVM, SURF and SSD algorithms were used. All the algorithms are tested on the database and, if possible, on video. A final evaluation is provided, along with possible future development options.
Defect detection on fiber materials using machine learning
Lang, Matěj ; Richter, Miloslav (referee) ; Honec, Peter (advisor)
Cílem této diplomové práce je automatizace detekce vad ve vláknitých materiálech. Firma SILON se již přes padesát let zabývá výrobou jemné vaty z recyklovaných PET lahví. Tato vata se následně používá ve stavebnictví, automobilovém průmyslu, ale nejčastěji v dámských hygienických potřebách a dětských plenách. Cílem firmy je produkovat co nejkvalitnější výrobek a proto je každá dávka testována v laboratoři s několika přísnými kritérii. Jednám z testů je i množství vadných vláken, jako jsou zacuchané smotky vláken, nebo nevydloužená vlákna, která jsou tvrdá a snadno se lámou. Navrhovaný systém sestává ze snímací lavice fungující jako scanner, která nasnímá vzorek vláken, který byl vložen mezi dvě skleněné desky. Byla provedena série testů s různým osvětlením, která ověřovala vlastnosti Rhodaminu, který se používá právě na rozlišení defektů od ostatních vláken. Tyto defekty mají zpravidla jinou molekulární strukturu, na kterou se barvivo chytá lépe. Protože je Rhodamin fluorescenční barvivo, je možné ho například pod UV světlem snáze rozeznat. Tento postup je využíván při manuální detekci. Při snímání kamerou je možno si vypomoci filtrem na kameře, který odfiltruje excitační světlo a propustí pouze světlo vyzářené Rhodaminem. Součástí výroby skeneru byla i tvorba ovládacího programu. Byla vytvořena vlastní knihovna pro ovládání motoru a byla upravena knihovna pro kameru. Oba systém pak bylo možno ovládat pomocí jednotného GUI, které zajišťovalo pořizování snímku celé desky. Pomocí skeneru byla nasnímána řada snímků, které bylo třeba anotovat, aby bylo možné naučit počítač rozlišovat defekty. Anotace proběhla na pixelové úrovni; každý defekt byl označen v grafickém editoru ve speciální vrstvě. Pro rozlišování byla použita umělá neuronová síť, která funguje na principu konvolucí. Tento typ sítě je navíc plně konvoluční, takže výstupem sítě je obraz, který by měl označit na tom původním vadné pixely. Výsledky naučené sítě jsou v práci prezentovány a diskutovány. Síť byla schopna se naučit rozeznávat většinu defektů a spolehlivě je umí rozeznat a segmentovat. Potíže má v současné době s detekcí rozmazaných defektů na krajích zorného pole a s defekty, jejichž hranice není tolik zřetelná na vstupních obrazech. Nutno zmínit, že zákazník má zájem o kompletní řešení scanneru i s detekčním softwarem a vývoj tohoto zařízení bude pokračovat i po závěru této diplomové práce.
Holistic License Plate Recognition Based on Convolution Neural Networks
Le, Hoang Anh ; Hradiš, Michal (referee) ; Špaňhel, Jakub (advisor)
Main goal of this work was to create a holistic license plate reader, with an emphasis on achieving the highest possible accuracy on low quality images. Combination of convolutional and recurrent neural networks was designed and implemented, with usage of LSTM and CTC, where the inputs are cut-outs from the entire license plate. Competitive networks were also implemented to compare results. Networks were compared on a total of 4 datasets and the results were, that my design has achieved the best results with a recognition accuracy of 97.6%.
Detection of Vehicle License Plates in Video
Líbal, Tomáš ; Hradiš, Michal (referee) ; Herout, Adam (advisor)
This thesis deals with preparation of training dataset and training of convolutional neural network for licence plate detection in video. Darknet technology was used for detection, specifically the YOLOv3-tiny neural network model. The solution was focused on the most accurate detection and the smallest number of false positives per image, thus minimizing overall model error. Dataset was prepared from existing freely available datasets, from the dataset provided by the GRAPH@FIT research group, and from self-annotated images created from downloaded YouTube videos. Furthermore, this dataset has been processed using data augmentation, extending it to twice the size. The YOLO Mark tool was used to create annotations. An ROC curve was used to visualize the detection success. Created solution reaches minimum total error 10,849%. Part of the solution is already mentioned dataset.
Detection of Wanted People in Video
Bažout, David ; Musil, Petr (referee) ; Beran, Vítězslav (advisor)
The aim of this work is to create a software tool for searching of wanted people in video recordings from surveillance cameras. Wanted people are identified to the system using multiple facial photos. The output consists of information on the occurrence of wanted persons in specific frames. The problem consists of face detection and its subsequent identification task. Experiments with existing approaches on appropriate datasets provide relevant comparisons of method performance under different conditions. Appropriate methods and their optimal settings for this particular task are chosen according to the results of the experiments. The thesis also deals with the design of suitable architecture, research of existing libraries implementing the tested methods and other ways of optimizing the calculation. The result is the implementation of a user application that meets the specified parameters. The application's functionality has been tested on the own dataset simulating real-world conditions.

National Repository of Grey Literature : 128 records found   beginprevious107 - 116nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.