National Repository of Grey Literature 2 records found  Search took 0.00 seconds. 
Apache Hadoop as analytics platform
Brotánek, Jan ; Novotný, Ota (advisor) ; Kerol, Valeria (referee)
Diploma Thesis focuses on integrating Hadoop platform into current data warehouse architecture. In theoretical part, properties of Big Data are described together with their methods and processing models. Hadoop framework, its components and distributions are discussed. Moreover, compoments which enables end users, developers and analytics to access Hadoop cluster are described. Case study of batch data extraction from current data warehouse on Oracle platform with aid of Sqoop tool, their transformation in relational structures of Hive component and uploading them back to the original source is being discussed at practical part of thesis. Compression of data and efficiency of queries depending on various storage formats is also discussed. Quality and consistency of manipulated data is checked during all phases of the process. Fraction of practical part discusses ways of storing and capturing stream data. For this purposes tool Flume is used to capture stream data. Further this data are transformed in Pig tool. Purpose of implementing the process is to move part of data and its processing from current data warehouse to Hadoop cluster. Therefore process of integration of current data warehouse and Hortonworks Data Platform and its components, was designed
Evaluation of CASE tools for database design
Brotánek, Jan ; Chlapek, Dušan (advisor) ; Bruckner, Tomáš (referee)
The focus of this thesis is on designing a process which can be used to evaluate data modeling CASE tools and tools supporting database designing. It maps requirements which are currently imposed to these tools and based on them defines criteria for their evaluation. Process is tested on a set of commercial CASE tools as well as open-source tools. Tools are evaluated based on set criteria and a product which best meets these requirements is recommended. The process is evaluated, modified and published as a seperate attachment after a verification process which uses a set of tools.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.