National Repository of Grey Literature 148 records found  1 - 10nextend  jump to record: Search took 0.00 seconds. 
Real-Time Face Tracking
Ermak, Aleksei ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
This bachelor thesis focuses on the issue of face tracking in real time. In the beginning, this work describes the existing methods of object tracking and face detection. The following part of this thesis concentrates on the design, implementation and testing of the convolutional neural network, which was proved as the effective solution for the face tracking issue. In addition to this, the implemented network is compared to those existing methods. The last part of the thesis describes the optimization of the designed network using OpenVINO toolkit provided by Intel.
Active Learning for OCR
Kohút, Jan ; Kolář, Martin (referee) ; Hradiš, Michal (advisor)
The aim of this Master's thesis is to design methods of active learning and to experiment with datasets of historical documents. A large and diverse dataset IMPACT of more than one million lines is used for experiments. I am using neural networks to check the readability of lines and correctness of their annotations. Firstly, I compare architectures of convolutional and recurrent neural networks with bidirectional LSTM layer. Next, I study different ways of learning neural networks using methods of active learning. Mainly I use active learning to adapt neural networks to documents that the neural networks do not have in the original training dataset. Active learning is thus used for picking appropriate adaptation data. Convolutional neural networks achieve 98.6\% accuracy, recurrent neural networks achieve 99.5\% accuracy. Active learning decreases error by 26\% compared to random pick of adaptations data.
Pedestrian Attribute Analysis
Studená, Zuzana ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
This work deals with obtaining pedestrian information, which are captured by static, external cameras located in public, outdoor or indoor spaces. The aim is to obtain as much information as possible. Information such as gender, age and type of clothing, accessories, fashion style, or overall personality are obtained using using convolutional neural networks. One part of the work consists of creating a new dataset that captures pedestrians and includes information about the person's sex, age, and fashion style. Another part of the thesis is the design and implementation of convolutional neural networks, which classify the mentioned pedestrian characteristics. Neural networks evaluate pedestrian input images in PETA, FashionStyle14 and BUT Pedestrian Attributes datasets. Experiments performed over the PETA and FashionStyle datasets compare my results to various convolutional neural networks described in publications. Further experiments are shown on created BUT data set of pedestrian attributes.
Automatic Industrial Quality Control from Image
Kruták, Martin ; Hradiš, Michal (referee) ; Zemčík, Pavel (advisor)
The goal of this thesis is to create overall, automatic and non-contact quality control of a pellet. The issue is divided into two separate parts. The first part deals with precise dimensional measuring of pellet - its length and head diameter so that it is precise and reasonably fast. Precise measuring is achieved with help of algorithms which achieve the sub-pixel precision by polynomial approximation of the edges extracted from the image gradients. The second part deals with the defects of a pellet. Detecting defects like longitudinal furrows or skirt cuts is achieved with convolutional neural networks. The measurement modules work with the resulting precision up to 0.025 mm in case of length measuring and up to 0.01 mm in case of head diameter measuring. In case of defect detections, neural network shows very high classification success rate. The contribution of this thesis is a presentation of innovative approaches in automatic quality control of pellets with use of neural networks and a demonstration of its usage in real manufacturing process.
Convolutional Networks for Lip Reading
Kadleček, Josef ; Kišš, Martin (referee) ; Hradiš, Michal (advisor)
This thesis deals with current methods for automatic speech recognition and lip reading via neural networks. Furthermore it deals with similarities in the architectures of neural networks for audio and visual data and available datasets in the field of audiovisual automatic speech recognition. The main contribution of this thesis is set of experiments comparing different changes in neural network architecture and its impact on results. The thesis includes an implementation of a system for automatic speech recognition from audio (CER: 12.6 %) and visual (CER: 57,7 %) data. The architectures of both systems are based on features extraction via convolutional networks followed by recurrent layers LSTM, another layer of convolutions and loss function CTC. 
Playing Gomoku with Neural Networks
Slávka, Michal ; Kolář, Martin (referee) ; Hradiš, Michal (advisor)
Táto práca sa zaoberá použitím algoritmu AlphaZero pre hru Gomoku. AlphaZero je založený na spätnoväzbnom učení a k trénovaniu nemusia byť využité žiadne existujúce datasety. Trénovanie prebieha iba na hrách algoritmu samého so sebou. AlphaZero používa algoritmus na prehľadávanie stromu, pre zlepšenie stratégie. Na vylepšnej stratégii sa následne trénuje neurónová sieť. Tento prístup bol úspešný v hrách proti existujúcim algoritmom. Generovanie trénovacích dát vysokej kvality si vyžaduje veľa výpočetne náročných iterácií trénovania a generovania dát. Experimenty ukázali, že každou iteráciou sa algoritmus zlepšuje, čo naznačuje, že je ešte miesto na zlepšenie, ale množstvo iterácií  nedostačovalo na to, aby bol poriadne natrénovaný.
Pedestrian Identification
Jurča, Jan ; Špaňhel, Jakub (referee) ; Hradiš, Michal (advisor)
This thesis deals with pedestrian identification from video sequence based on person, face and gait recognition. For person and face recognition are used pretrained networks. While for gait recognition is implemented and compared many different networks. Final pedestrian recognition is based on multimodal fusion realized by neural network. For the purpose of the work was created dataset, along with a set of tools that allow its almost automatic creation.
Holistic License Plate Recognition Based on Convolution Neural Networks
Le, Hoang Anh ; Hradiš, Michal (referee) ; Špaňhel, Jakub (advisor)
Main goal of this work was to create a holistic license plate reader, with an emphasis on achieving the highest possible accuracy on low quality images. Combination of convolutional and recurrent neural networks was designed and implemented, with usage of LSTM and CTC, where the inputs are cut-outs from the entire license plate. Competitive networks were also implemented to compare results. Networks were compared on a total of 4 datasets and the results were, that my design has achieved the best results with a recognition accuracy of 97.6%.
Vehicle License Plate Detection and Recognition Software
Masaryk, Adam ; Hradiš, Michal (referee) ; Špaňhel, Jakub (advisor)
The aim of this bachelor thesis is to design and develop software that can detect and recognize license plates from images. The software is divided into 3 parts - license plates detection, detector output processing and license plates characters recognition. We decided to implement detection and recognition using modern methods using convolutional neural networks.
Detection of Vehicle License Plates in Video
Líbal, Tomáš ; Hradiš, Michal (referee) ; Herout, Adam (advisor)
This thesis deals with preparation of training dataset and training of convolutional neural network for licence plate detection in video. Darknet technology was used for detection, specifically the YOLOv3-tiny neural network model. The solution was focused on the most accurate detection and the smallest number of false positives per image, thus minimizing overall model error. Dataset was prepared from existing freely available datasets, from the dataset provided by the GRAPH@FIT research group, and from self-annotated images created from downloaded YouTube videos. Furthermore, this dataset has been processed using data augmentation, extending it to twice the size. The YOLO Mark tool was used to create annotations. An ROC curve was used to visualize the detection success. Created solution reaches minimum total error 10,849%. Part of the solution is already mentioned dataset.

National Repository of Grey Literature : 148 records found   1 - 10nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.