National Repository of Grey Literature 45 records found  beginprevious26 - 35next  jump to record: Search took 0.00 seconds. 
Acceleration of Vector and Cryptographic Operations on x86-64 Platform
Šlenker, Samuel ; Martinásek, Zdeněk (referee) ; Balík, Miroslav (advisor)
The aim of this thesis was to study and subsequently process a comparison of older and newer SIMD processing units of modern microprocessors on the x86-64 platform. The thesis provides an overview of the fastest computations of vector operations with matrices and vectors, including corresponding source codes. Furthermore, the thesis is focused on authenticated encryption, specifically on block cipher AES operating in Galois Counter Mode, and on a discussion of possibilities of instruction sets for cryptographic support.
Acceleration of Object Detection Using Classifiers
Juránek, Roman ; Kälviäinen, Heikki (referee) ; Sojka, Eduard (referee) ; Zemčík, Pavel (advisor)
Detekce objektů v počítačovém vidění je složítá úloha. Velmi populární a rozšířená metoda pro detekci je využití statistických klasifikátorů a skenovacích oken. Pro učení kalsifikátorů se často používá algoritmus AdaBoost (nebo jeho modifikace), protože dosahuje vysoké úspěšnosti detekce, nízkého počtu chybných detekcí a je vhodný pro detekci v reálném čase. Implementaci detekce objektů je možné provést různými způsoby a lze využít vlastnosti konkrétní architektury, pro urychlení detekce. Pro akceleraci je možné využít grafické procesory, vícejádrové architektury, SIMD instrukce, nebo programovatelný hardware. Tato práce představuje metodu optimalizace, která vylepšuje výkon detekce objektů s ohledem na cenovou funkci zadanou uživatelem. Metoda rozděluje předem natrénovaný klasifikátor do několika různých implementací, tak aby celková cena klasifikace byla minimalizována. Metoda je verifikována na základním experimentu, kdy je klasifikátor rozdělen do předzpracovací jednotku v FPGA a do jednotky ve standardním PC.
Lifting Scheme Cores for Wavelet Transform
Bařina, David ; Kälviäinen, Heikki (referee) ; Sojka, Eduard (referee) ; Zemčík, Pavel (advisor)
Práce se zaměřuje na efektivní výpočet dvourozměrné diskrétní vlnkové transformace. Současné metody jsou v práci rozšířeny v několika směrech a to tak, aby spočetly tuto transformaci v jediném průchodu, a to případně víceúrovňově, použitím kompaktního jádra. Tohle jádro dále může být vhodně přeorganizováno za účelem minimalizace užití některých prostředků. Představený přístup krásně zapadá do běžně používaných rozšíření SIMD, využívá hierarchii cache pamětí moderních procesorů a je vhodný k paralelnímu výpočtu. Prezentovaný přístup je nakonec začleněn do kompresního řetězce formátu JPEG 2000, ve kterém se ukázal být zásadně rychlejší než široce používané implementace.
Efficient Implementation of High Performance Algorithms on Multi-Core Processors
Tomečko, Lukáš ; Bidlo, Michal (referee) ; Jaroš, Jiří (advisor)
This thesis describes the process of parallelization and vectorization of fluid simulation using OpenMP library and Intel compiler. Various approaches were tried e.g. cache blocking, data sorting and data reorganization. By combining the best of them, final application preformed 11.4 times faster than the original one, using 16 cores. Benchmarks show that used algorithms are not suitable for vectorization.
Compilation of OpenCL Applications for Embedded Systems
Šnobl, Pavel ; Čekan, Ondřej (referee) ; Hruška, Tomáš (advisor)
This master's thesis deals with the support for compilation and execution of programs written using OpenCL framework on embedded systems. OpenCL is a system for programming heterogeneous systems comprising processors, graphic accelerators and other computing devices. But it also finds usage on systems composed of just one computing unit, where it allows to write parallel programs (task and data parallelism) and work with hierarchical system of memories. In this thesis, various available open source OpenCL implementations are compared and one selected is then integrated into LLVM compiler infrastructure. This compiler is generated as a part of toolchain provided by application specific instruction set architecture processor developement environment called Codasip Studio. Designed and implemented are also optimizations for architectures with SIMD instructions and VLIW architectures. The result is tested and demonstrated on a set of testing applications.
Processing units of last generation processors and their utilization
Šlenker, Samuel ; Pavlíček, Tomáš (referee) ; Balík, Miroslav (advisor)
The aim of this thesis was to study and subsequently process the differences between the older instruction sets and newer instruction sets, to specify the benefits of the individual extensions, to compare the way of computations of the individual SIMD processing units and to compare the implementation of these processing units in Intel and AMD companies. Part of this work are two theoretical introductions to laboratory tasks.
SIMD Instructions Support in LLVM Compiler
Šnobl, Pavel ; Hynek, Jiří (referee) ; Masařík, Karel (advisor)
This bachelor thesis deals with support of automatic vectorization of code in the LLVM compilation framework and with extension of Codix processor model of SIMD instructions. As a result, LLVM is able to create reports about the process of auto-vectorization and it is possible to use special pragma directives to provide the compiler with additional information for optimizations of programs. Also a way of providing information about architectures of processors created using development environment Codasip Framework, needed for more effective vectorization, is introduced and implemented. Finally a set of integer vector instructions and related new registers for Codix is chosen and added to the model.
Generating Code of Optimised Mathematical Operations
Beneš, Vojtěch ; Horáček, Petr (referee) ; Čermák, Martin (advisor)
Bachelor's thesis deals with creating a simple programming language for working with mathematical operations. Main point of the thesis is to create a compiler of this language, which is using MMX technology to generate instructions of an assembler code. The optimized code generation is based on modified algorithm of context generation.
Search of Corresponding Objects in a Pair of Images
Vrbenský, Andrej ; Herman, David (referee) ; Orság, Filip (advisor)
This thesis is aimed on area of stereoscopy. At first, there is  some space dedicated to stereoscopic theory, primarily to seaching of corresponding objects in a stereo image pair. Main attention is given to template matching methods, which are based on intesity comparison. Then we try to optimize these methods with SIMD SSE instructions and run some tests with image examples. These methods are implemented in C++ and also in assembly language.
GPU Image Processing Library
Čermák, Michal ; Španěl, Michal (referee) ; Smrž, Pavel (advisor)
This work is concerned with architecture of recent Nvidia graphics cards and application programming interface CUDA. That is used to create accelerated image processing library. It place emphasis on testing performance gain compassion with high optimized and used OpenCv library.

National Repository of Grey Literature : 45 records found   beginprevious26 - 35next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.