Original title:
Application of Data Engineering Technologies in Bioinformatics
Authors:
SAIBOLD, Anna Document type: Bachelor's theses
Year:
2018
Language:
eng Abstract:
Due to the huge amount of biological data, the various available data sources, and the diversity of their structure as well as content, data engineering technologies are required. They provide an important opportunity to support their exploitation. This thesis aims at applying several data engineering steps to a particular real-world data source to demonstrate the additional benefit with respect to utilization of the data by means of connecting to other data sources as well as querying and analyzing the data. Therefore, in the practical part of this thesis a continuous example showing several engineering steps is constructed, comprising the development of different schemata, the creation of a database as well as the mapping and integration of future heterogeneous data. Finally, processing queries against the engineered data source is compared to an online database search regarding different aspects like time, effort, and usability. As the example shows, an engineered database can have huge benefits over online search, especially for complex queries, processing data from several sources.
Keywords:
Bioinformatics; Cancer; Database; Dataengineering; Disease; Gene; Informationsystems; Integration; Mapping; Ontology; Query; Schema; XML Citation: SAIBOLD, Anna. Application of Data Engineering Technologies in Bioinformatics. České Budějovice, 2018. bakalářská práce (Bc.). JIHOČESKÁ UNIVERZITA V ČESKÝCH BUDĚJOVICÍCH. Přírodovědecká fakulta
Institution: University of South Bohemia in České Budějovice
(web)
Document availability information: Fulltext is available in the Digital Repository of University of South Bohemia. Original record: http://www.jcu.cz/vskp/52998