keywords:"Data Augmentation" - Výsledky hledání - Digitální repozitář

host :: přihlásit Digitální repozitář
		Hledej		Nový záznam		Nápověda		O repozitáři

Hlavní stránka > Výsledky hledání: keywords:"Data Augmentation"

Hledej:

Tipy pro vyhledávaní :: Rozšířené hledání

Hledej ve sbírkách:

Seřadit podle:	Zobrazit výsledky:	Výstupní formát:

	Datové sady pro síťovou bezpečnost Setinský, Jiří ; Hranický, Radek (oponent) ; Tisovčík, Peter (vedoucí práce) V oblasti síťové bezpečnosti se používají techniky strojového učení pro efektivní detekci anomálií a malwaru v síťovém provozu. Pro natrénování síťového klasifikátoru s vysokou úspěšností je potřeba kvalitní datová sada. Cílem práce je modifikace datové sady pomocí metod strojového učení za účelem zlepšení kvality datové sady, která povede na natrénování modelu s vyšší úspěšností. Datová sada je zanalyzována shlukovacím algoritmem a každý shluk je charakterizován statistickým popisem vyplývající z atributů vstupní datové sady. Statistický popis spolu s informacemi o původním klasifikátoru je použit pro výpočet skóre. Skóre slouží jako váha při modifikační fázi. Shluková analýza umožní vyfiltrovat data, která jsou důležitá pro natrénování výsledného modelu. Navržený přístup umožňuje zmírnit redundanci datové sady a nebo ji rozšířit o chybějící data. Výsledkem je modifikační framework, který je schopen redukovat datové sady nebo provádět jejich agregaci za účelem vytvoření kompaktní datové sady, která bude reflektovat aktuální síťový provoz. Na vytvořených datových sadách se podařilo natrénovat modely dosahující vyšší úspěšnosti v porovnání s existujícím řešením. Úplný záznam
	Keyboard and Keys Image Recognition Lorenc, Jan ; Lichtner, Ondrej (oponent) ; Pluskal, Jan (vedoucí práce) The goal of this thesis is to create a solution for keyboard keys recognition to automate robotic writing on keyboards. Datasets for keyboard detection in an image, character detection in an image and post-processing correction of the character detection based on various keyboard layouts were created as prerequisites for this work. This research presents several approaches towards keyboard keys detection problem and selects the most suitable one. The chosen strategy is to split the problem into 3 phases which correspond to the prepared datasets. First of all, a separate keyboard detection is run. After that, characters are recognized in the detected keyboard region. These tasks are accomplished using neural networks and Canny edge detection technique. The last phase is the post-processing of the detection results (character correction, autocompletion of undetected characters, special keys distinction etc.). The results of each phase are evaluated. The contribution of the thesis lies in the creation of the datasets for keyboard and keys detection, and novel modular and extensible solution for the recognition process that yields very promising results. Úplný záznam
	Automatic Speech Detection for VHF Channel Nováková, Mária ; Veselý, Karel (oponent) ; Szőke, Igor (vedoucí práce) A noisy environment in air traffic communication is an unavoidable problem. The communication between the control tower and the pilot should be the most reliable and effective. That is why voice activity detection is crucial for recognising the start of the speech segment of the communicants for automated systems. The speakers take turns providing information by pressing the push-to-talk button. To detect voice activity, various approaches are used. Even though these methods are effective, machine learning can easily outshine them. Neural networks are widely used in voice activity detection as well as in other areas. Properly trained models are efficient and adaptable. In this thesis, a solution for voice activity detection together with push-to-talk detection is proposed. Proposed models are evaluated and compared. The adaptation of the GPVAD approach is discussed and compared to the proposed models. Neural networks will have their chance to once again prove that they are suitable for any task. Úplný záznam
	Augmentace datasetu s využitím metod přenesení stylu Wolny, Michał ; Ligocki, Adam (oponent) ; Kratochvíla, Lukáš (vedoucí práce) Tato bakalářská práce se zaměřuje na metody augmentace datasetu a metody přenesení stylu. Z celého spektra dostupných algoritmů pro přenos stylu byly vybrány tři velmi odlišné metody, které byly implementovány a následně experimentálně použity k augmentaci datasetu. Efektivita augmentace pomocí těchto metod byla ověřena prostřednictvím provedení statistické analýzy každého nově vytvořeného datasetu ve srovnání s původním, neupraveným datasetem. Výsledky analýzy poskytují důležité informace o změnách ve statistických charakteristikách, jako je entropie, průměr, medián, rozptyl a směrodatná odchylka. Tyto informace pomohly zhodnotit účinnost a vliv použitých metod augmentace na rozšířený dataset a poskytnout důkazy o jejich potenciálu. Úplný záznam
	Imbalanced data training approaches in neural network Vicianová, Veronika ; Ředina, Richard (oponent) ; Jakubíček, Roman (vedoucí práce) This thesis deals with the research and implementation of methods that eliminate the influence of an imbalanced dataset on the learning of neural networks. Individual methods are compared with each other for different levels of imbalance. The experiments carried out in the work are also compared with the available literature and a control experiment, which was carried out without the method of eliminating the influence of an imbalanced dataset. The experiments are extended to another dataset containing the original imbalance and compared. In the theoretical section, the topic of neural networks and the problems that may occur during learning are brought up. Subsequently, convolutional networks and their optimization algorithms are presented. The thesis also contains a more detailed presentation of the issue of an imbalanced dataset, including the metrics used in experiments and approaches used to eliminate this problem. Úplný záznam
	Výzkum nových metod augmentace online písma Sigmund, Jan ; Burget, Radim (oponent) ; Zvončák, Vojtěch (vedoucí práce) Grafomotorické potíže dětí školního věku se vyznačují ztíženými projevy při psaní a kreslení a mohou vést k vývojové dysgrafii. Včasná klinická diagnostika je kritická pro poskytnutí preventivní péče. V praxi ale není proveditelná na každodenní bázi z důvodu nutnosti expertního personálu a hojnosti výskytu potíží až 30\%. Modelování strojovým učením může posloužit jako přístupný objektivní nástroj pro zhodnocení grafomotorického výkonu. Většinou ale neexistuje dostatečně velký dataset, což se projeví na horší schopnosti klasifikace. Tato diplomová práce se proto zabývá augmentací dat online písma (OP). Generování umělých vzorků se provede na základě rekombinace vlastních funkcí (IMFs) získaných \textit{empirickou modální dekompozicí} (EMD). Vypočtou se IMFs skupin intaktních (HC), v počtu 72, a s grafomotorickými potížemi (GP), čítající 94 dětí. Rozklad se provede konkrétně na časových řadách souřadnic X a Y. IMFs o stejných indexech různých subjektů se náhodně zamění, čímž vznikne signál nový. Následně se extrahují grafomotorické parametry původních a nových časových řad. Z těch se vyberou pouze prostorové, související se souřadnicemi. Nakonec bude provedena analýza a srovnání korelací parametrů obou databází. Úplný záznam
	Algoritmy pro zlepšení detekce vybraných arytmií v EKG Šandová, Hana ; Ředina, Richard (oponent) ; Novotná, Petra (vedoucí práce) Práce se zabývá navyšováním datových sad arytmií v EKG, které bývají v databázích méně často zastoupené. Teoretická část je věnována literární rešerši odborných prací, které se zabývají klasifikátory srdečních arytmií s využitím hlubokého učení a augmentací EKG signálů. V praktické části byly realizovány generátory šumu, které by po přičtení do reálně změřených signálů mohly obohatit datovou sadu. Také byla realizována metoda pro augmentaci flutteru síní. Byly vytvořeny funkce, které uměle vytvářejí signály napodobující atrioventikulární blokády II. a III. stupně. Nakonec byla snaha generovat atrioventikulární blokády II. stupně pomocí generativních kompetitivních sítí (GAN). Úspěšnost augmentace byla hodnocena přidáním různě kombinovaných syntetických dat do trénovacích množin pro EKG klasifikátor v podobě hluboké neuronové sítě. Úplný záznam
	Pedestrian Detector Domain Shift Robustness Evaluation, And Domain Shift Error Mitigation Proposal Zemčík, Tomáš This paper evaluates daytime to nighttime traffic image domain shift on Faster R-CNNand SSD based pedestrian and cyclist detectors. Daytime image trained detectors are applied on anewly compiled nighttime image dataset and their performance is evaluated against detectors trainedon both daytime and nighttime images. Faster R-CNN based detectors proved relatively robust, butstill clearly inferior to the models trained on nighttime images, the SSD based model proved noncompetitive.Approaches to the domain shift deterioration mitigation were proposed and future workoutlined. Úplný záznam
	Techniques For Avoiding Model Overfitting On Small Dataset Kratochvila, Lukas Building a deep learning model based on small dataset is difficult, even impossible. Toavoiding overfitting, we must constrain model, which we train. Techniques as data augmentation,regularization or data normalization could be crucial. We have created a benchmark with a simpleCNN image classifier in order to find the best techniques. As a result, we compare different types ofdata augmentation and weights regularization and data normalization on a small dataset. Úplný záznam
	Segmentation of multiple sclerosis lesions using deep neural networks Sasko, Dominik ; Myška, Vojtěch (oponent) ; Kolařík, Martin (vedoucí práce) This master thesis focused on automatic segmentation of Multiple Sclerosis (MS) lesions on MRI images. We tested the latest methods of segmentation using Deep Neural Networks and compared the approaches of weight initialization by transfer learning and self-supervised learning. The automatic segmentation of MS lesions is a very challenging task, primarily due to the high imbalance of the dataset (brain scans usually contain only a small amount of damaged tissue). Another challenge is a manual annotation of these lesions, as two different doctors can mark other parts of the brain as damaged and the Dice Coefficient of these annotations is approximately 0.86, which further underlines the complexity of this task. The possibility of simplifying the annotation process by automatization could improve the lesion load determination and might lead to better diagnostic of each individual patient. Our goal was to propose two techniques that use transfer learning to pre-train weights to later improve the performance of existing segmentation models. The theoretical part describes the division of artificial intelligence, machine learning and deep neural networks and their use in image segmentation. Afterwards, the work provides a description of Multiple Sclerosis, its types, symptoms, diagnosis and treatment. The practical part begins with data preprocessing. Firstly, brain scans were adjusted to the same resolution with the same voxel size. This was needed due to the usage of three different datasets, in which the scans had been created by devices from different manufacturers. One dataset also included the skull, therefore it was necessary to remove it by an FSL tool, leaving only the patient's brain in the scan. The preprocessed data were 3D scans (FLAIR, T1 and T2 modalities), which were cut into individual 2D slices and used as an input for the neural network with encoder-decoder architecture. The whole dataset consisted a total of 6,720 slices with a resolution of 192 x 192 pixels for training (after removing slices where the mask was empty). Loss function was Combo loss (combination of Dice Loss with modified Cross-Entropy). The first technique was to use the pre-trained weights from the ImageNet dataset on encoder in U-Net network, with and without locked encoder weights, respectively, and compare the results with random weight initialization. In this case, we used only the FLAIR modality. Transfer learning has proven to increase the metrics from approximately 0.4 to 0.6. The difference between encoder with and without locked weights was about 0.02. The second proposed technique was to use a self-supervised context encoder with Generative Adversarial Networks (GAN) to pre-train the weights. This network used all three modalities also with the empty slices (23,040 slices in total). The purpose of GAN was to recreate the brain image, which was covered by a checkerboard. Weights learned during this training were later loaded for the encoder to apply to our segmentation problem. The following experiment did not show any improvement, with a DSC value of 0.29 and 0.09, with and without a locked encoder, respectively. Such a decrease in performance might have been caused by the use of weights pre-trained on two distant problems (segmentation and self-supervised context encoder) or by difficulty of the task considering the hugely unbalanced dataset. Úplný záznam

Chcete být upozorněni, pokud se objeví nové záznamy odpovídající tomuto dotazu?
Přihlásit se k odběru RSS.

Digitální repozitář :: :: :: ::
Powered by v1.1.2
Spravuje

Tato stránka je dostupná také v následujících jazycích:
Česky English