National Repository of Grey Literature 3 records found  Search took 0.01 seconds. 
Nástroje a metódy pre spracovanie veľkého objemu dát zaznamenaného z dátové zbernice lietadla
Tonhajzer, Tomáš
This thesis deals with methods and technologies for storing and processing big data. Thesis contains design of tools for data storing and creation of system for processing and visualization of big data recorded from airplane data bus.
Framework for information extraction from the large language data sets
Kuboň, David ; Križ, Vincent (advisor) ; Bednárek, David (referee)
This thesis describes the FAFEFI program that focuses on n-gram and skip-gram extraction from large data sets. The thesis presents two different approaches to passing input data to the program. It also describes the design of data structures for n-gram and skip-gram representation within computer memory, the algorithm of n-gram and skip-gram extraction, memory-friendly options of saving extracted data and their final composition into output feature vectors. It also offers a variety of extra functions such as line filter and line modifier and a great deal of configurable parameters ranging from in-file separators to formatting the names of output files. Moreover, the program provides a differentiation in its activity by enabling saving data just after extraction from the train set and brings tools for cluster parallelization. Powered by TCPDF (www.tcpdf.org)
Optimization of the Distributed I/O Subsystem of the k-Wave Project
Vysocký, Ondřej ; Hrbáček, Radek (referee) ; Jaroš, Jiří (advisor)
This thesis deals with an effective solution of parallel writing of variable amounts of data on the Lustre file system. The work will be used by the k-Wave project designed for time domain acoustic and ultrasound simulations. Since the simulation is computationally and data intensive, the project requires to be implemented with libraries for parallel computig (Open MPI) and large data processing (HDF5) and it must run on a supercomputer. The application is implemented in C and uses previously mentioned libraries. The proper settings of the Lustre file system leads to the peak write bandwith of 2.5 GB/s that corresponds to a speedup factor of 5 compared to the reference settings. The data aggregation improved the write bandwidth by a factor of 3 compared to a naive version. Here, the achieved I/O bandwidth for certain block sizes hits the limits of the Anselm I/O subsytem (3GB/s).

Interested in being notified about new results for this query?
Subscribe to the RSS feed.