National Repository of Grey Literature 4 records found  Search took 0.00 seconds. 
Optimization and Refinement of XML Schema Inference Approaches
Klempa, Michal ; Holubová, Irena (advisor) ; Stárka, Jakub (referee)
Although XML is a widely used technology, the majority of real-world XML documents does not conform to any particular schema. To fill the gap, the research area of automatic schema inference from XML documents has emerged. This work refines and extends recent approaches to the automatic schema inference mainly by exploiting an obsolete schema in the inference process, designing new MDL measures and heuristic excluding of excentric data inputs. The work delivers a ready-to-use and easy-to-extend implementation integrated into the jInfer framework (developed as a software project). Experimental results are a part of the work.
Mining XML Integrity Constraints
Fajt, Stanislav ; Nečaský, Martin (advisor) ; Svoboda, Martin (referee)
The most important integrity constraints in XML are primary keys and foreign keys. In general, keys are essential in understanding both the structure and properties of data. They provide an instrument by which va- lues from a given set of attributes uniquely identify tuples in a database. As a result, keys are important to main database operations. Since XML beco- mes lingua franca for data exchange on the web, it is widely accepted as a model of real world data. Because XML documents in general can appear in any semi-structured form, structural constraints (including keys) are often imposed on the data that are to be modified or processed These constra- ints are formally defined in a schema.Unfortunately, in spite of the obvious advantages, the presence of a schema is not mandatory and many XML do- cuments are not joined with any. Consequently, no integrity constratins are specified in those documents, neither. This thesis is mainly focused on the inference of primary and foreign keys from XML documents. 1
Optimization and Refinement of XML Schema Inference Approaches
Klempa, Michal ; Holubová, Irena (advisor) ; Stárka, Jakub (referee)
Although XML is a widely used technology, the majority of real-world XML documents does not conform to any particular schema. To fill the gap, the research area of automatic schema inference from XML documents has emerged. This work refines and extends recent approaches to the automatic schema inference mainly by exploiting an obsolete schema in the inference process, designing new MDL measures and heuristic excluding of excentric data inputs. The work delivers a ready-to-use and easy-to-extend implementation integrated into the jInfer framework (developed as a software project). Experimental results are a part of the work.
Mining XML Integrity Constraints
Fajt, Stanislav ; Nečaský, Martin (advisor) ; Svoboda, Martin (referee)
The most important integrity constraints in XML are primary keys and foreign keys. In general, keys are essential in understanding both the structure and properties of data. They provide an instrument by which va- lues from a given set of attributes uniquely identify tuples in a database. As a result, keys are important to main database operations. Since XML beco- mes lingua franca for data exchange on the web, it is widely accepted as a model of real world data. Because XML documents in general can appear in any semi-structured form, structural constraints (including keys) are often imposed on the data that are to be modified or processed These constra- ints are formally defined in a schema.Unfortunately, in spite of the obvious advantages, the presence of a schema is not mandatory and many XML do- cuments are not joined with any. Consequently, no integrity constratins are specified in those documents, neither. This thesis is mainly focused on the inference of primary and foreign keys from XML documents. 1

Interested in being notified about new results for this query?
Subscribe to the RSS feed.