National Repository of Grey Literature 12 records found  1 - 10next  jump to record: Search took 0.01 seconds. 
Similarity of XML Data
Stárka, Jakub ; Holubová, Irena (advisor) ; Klímek, Jakub (referee)
In the present work we study the possibilities of reverse engineering of XML schemas. The work contains a survey of XML and commonly used languages for describing XML schemas, an overview of existing techniques for conceptual modeling, reverse engineering and methods for the mapping evaluation between XML schemas. A new method, based on analysis of the conceptual model XSEM and the subsequent creation of a decision tree, is introduced. The method allows effectively nd a mapping from XML schemas to models XSEM. The work also describes a new technique for selection of the path between the mapped classes. Finally, the work contains a number of experiments that show the advantages and disadvantages of the proposed solutions.
Optimization and Refinement of XML Schema Inference Approaches
Klempa, Michal ; Holubová, Irena (advisor) ; Stárka, Jakub (referee)
Although XML is a widely used technology, the majority of real-world XML documents does not conform to any particular schema. To fill the gap, the research area of automatic schema inference from XML documents has emerged. This work refines and extends recent approaches to the automatic schema inference mainly by exploiting an obsolete schema in the inference process, designing new MDL measures and heuristic excluding of excentric data inputs. The work delivers a ready-to-use and easy-to-extend implementation integrated into the jInfer framework (developed as a software project). Experimental results are a part of the work.
Podpora návrhových vzorů ve vývojových prostředích
Šindelář, Štěpán ; Zavoral, Filip (advisor) ; Stárka, Jakub (referee)
A design pattern is a description of communicating objects and classes that are customized to solve a general design problem in a concrete context. The main aim of design patterns is to make the design reusable and flexible. However, the flexibility provided by design patterns is usually achieved by introducing new classes into the design and developers often don't have enough time to create a textual documentation for them, therefore the mapping between classes and design patterns is lost. Moreover, incorrect understanding of a specific design pattern can produce communication errors, or even software bugs. In this thesis we present Patterns4Net project that targets the .NET platform. With Patterns4Net developers can annotate their classes using special attributes that document the usage of design patterns in a standardized way. This documentation is then used to verify correctness of design patterns implementation, to generate interactive UML-like class diagrams that emphasize the connection between design patterns and concrete classes.
Governmental Linked Data and Experimental Application
Pošepný, Tomáš ; Nečaský, Martin (advisor) ; Stárka, Jakub (referee)
The aim of this thesis is to introduce the technologies of Resource Description Framework and Linked Data and map the current situation in pub- lishing procurement in the Czech Republic on the Internet. The thesis is focused on problems and de ciencies in the publishing and o ers solution in the form of Government Linked Data. Along with that, an experimental application for scraping and triplifying public contracts from the E-ZAK system is developed. It also allows possible later extensions for other systems administrating public procurement. The obtained RDF dataset is mashed up with a Linked Data da- ta source for public contracts posted in ISVZ containing also information about organizations from ARES system. The mashed data are visualized in a simple Web application. In the conclusion, the entire thesis is summarized and a couple of hints for developers of similar applications are mentioned.
Analysis of Real-World Data and Their Exploitation
Stárka, Jakub ; Holubová, Irena (advisor) ; Krátký, Michal (referee) ; Collard, Martine (referee)
Title: Analyses of Real-World Data and Their Exploitation Author: Mgr. Jakub Stárka Department: Department of Software Engineering Supervisor: RNDr. Irena Holubová, Ph.D. Abstract: The typical optimization strategy of many data processing techniques is ex- ploitation of the knowledge of constructs typically used in real-world applications. However, such approach requires a repeatable, updatable and detailed analysis of a rep- resentative data set. Having such a requirement a number of related problems arises, such as automatic crawling of the data, data extraction, schema inference, and efficient performance of analyses over a huge data volume as well as exploitation of the results in current applications. In this thesis we describe a complex framework for performing statistical analyses of real-world documents and we propose characteristics that appropriately capture and describe features of XML documents, RDF triples and XQuery queries. Additionally we provide experimental results over a few selected real-world data sets. Last but not least we introduce an easily extensible tool that enables one to implement, test and compare new modules of the XML schema inference process. We describe not only the framework, but the area of schema inference in general, including related work and open problems. Keywords:...
Analysis of Real-World Data and Their Exploitation
Stárka, Jakub ; Holubová, Irena (advisor) ; Krátký, Michal (referee) ; Collard, Martine (referee)
Title: Analyses of Real-World Data and Their Exploitation Author: Mgr. Jakub Stárka Department: Department of Software Engineering Supervisor: RNDr. Irena Holubová, Ph.D. Abstract: The typical optimization strategy of many data processing techniques is ex- ploitation of the knowledge of constructs typically used in real-world applications. However, such approach requires a repeatable, updatable and detailed analysis of a rep- resentative data set. Having such a requirement a number of related problems arises, such as automatic crawling of the data, data extraction, schema inference, and efficient performance of analyses over a huge data volume as well as exploitation of the results in current applications. In this thesis we describe a complex framework for performing statistical analyses of real-world documents and we propose characteristics that appropriately capture and describe features of XML documents, RDF triples and XQuery queries. Additionally we provide experimental results over a few selected real-world data sets. Last but not least we introduce an easily extensible tool that enables one to implement, test and compare new modules of the XML schema inference process. We describe not only the framework, but the area of schema inference in general, including related work and open problems. Keywords:...
Universal Framework for HTML Triplification
Kadleček, Rastislav ; Stárka, Jakub (advisor) ; Klímek, Jakub (referee)
The aim of this bachelor thesis is to introduce Linked Data and Resource Description Framework technologies, and map the current situation in the field of HTML document data extraction and extracted data conversion to RDF format. In this thesis, the software system Strigil is introduced. This system is designed to triplificate data from HTML documents, however, it is extensible for another file formats. The features of this system are demonstrated by triplificating data from selected Web sites. Then, some statistical information about this RDF data are shown. In the conclusion of this thesis, the entire thesis is summarized, and some useful hints about Web site scraping are mentioned.
Governmental Linked Data and Experimental Application
Pošepný, Tomáš ; Nečaský, Martin (advisor) ; Stárka, Jakub (referee)
The aim of this thesis is to introduce the technologies of Resource Description Framework and Linked Data and map the current situation in pub- lishing procurement in the Czech Republic on the Internet. The thesis is focused on problems and de ciencies in the publishing and o ers solution in the form of Government Linked Data. Along with that, an experimental application for scraping and triplifying public contracts from the E-ZAK system is developed. It also allows possible later extensions for other systems administrating public procurement. The obtained RDF dataset is mashed up with a Linked Data da- ta source for public contracts posted in ISVZ containing also information about organizations from ARES system. The mashed data are visualized in a simple Web application. In the conclusion, the entire thesis is summarized and a couple of hints for developers of similar applications are mentioned.
Optimization and Refinement of XML Schema Inference Approaches
Klempa, Michal ; Holubová, Irena (advisor) ; Stárka, Jakub (referee)
Although XML is a widely used technology, the majority of real-world XML documents does not conform to any particular schema. To fill the gap, the research area of automatic schema inference from XML documents has emerged. This work refines and extends recent approaches to the automatic schema inference mainly by exploiting an obsolete schema in the inference process, designing new MDL measures and heuristic excluding of excentric data inputs. The work delivers a ready-to-use and easy-to-extend implementation integrated into the jInfer framework (developed as a software project). Experimental results are a part of the work.
Podpora návrhových vzorů ve vývojových prostředích
Šindelář, Štěpán ; Zavoral, Filip (advisor) ; Stárka, Jakub (referee)
A design pattern is a description of communicating objects and classes that are customized to solve a general design problem in a concrete context. The main aim of design patterns is to make the design reusable and flexible. However, the flexibility provided by design patterns is usually achieved by introducing new classes into the design and developers often don't have enough time to create a textual documentation for them, therefore the mapping between classes and design patterns is lost. Moreover, incorrect understanding of a specific design pattern can produce communication errors, or even software bugs. In this thesis we present Patterns4Net project that targets the .NET platform. With Patterns4Net developers can annotate their classes using special attributes that document the usage of design patterns in a standardized way. This documentation is then used to verify correctness of design patterns implementation, to generate interactive UML-like class diagrams that emphasize the connection between design patterns and concrete classes.

National Repository of Grey Literature : 12 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.