National Repository of Grey Literature 98 records found  beginprevious41 - 50nextend  jump to record: Search took 0.00 seconds. 
Web Information Extraction
Pekárek, Filip ; Galamboš, Leo (advisor) ; Kopecký, Michal (referee)
In the present work we suggest and test new process of web information extraction. Proposed method consider DOM tree of the web page including it's visual cues. Basic and the rst part is semantic parts extraction of a page using VIPS algorithm. Next step is validation and eventual modication of gained information based on the local context. Final part is classication of analyzing page into predened classes using got facts. Set of critics implemented by congurable instances of neural networks determine the classes.
Dynamics of dissolved organic matter depending on the activity of ants in coniferous temperate forest
Vacířová, Anna ; Jandová, Kateřina (advisor) ; Cepáková, Šárka (referee)
5 Abstrakt Soil organic matter is the largest reservoir of carbon in terrestrial ecosystems, and its functionally important component is dissolved organic matter. Its composition and changes in this composition during the vegetation season affect the vegetation, the activity of the organisms in the soil and the formation of soil organic matter. Local bio-hotspots such as anthills create spatial heterogeneity of the soil solution composition. In my work I analyzed cold water (CWE) and hot water (HWE) soil extracts, which I took in different distances from the anthill, in three soil horizons (O, A, B) three times during one vegetation season. I measured contents of carbon (C), nitrogen (N), C: N ratio and the parameter SUVA254 and SUVA280. The labile C and N content varied with the distance from the anthill. The C and N content with the distance from the anthill at O horizon increases, decreases in A and B. C: N ratio was the highest in the A horizon, but during the season the differences were not so significant. The smallest content of C and N was in April, the highest in July. The SUVA parameter in O and B decreases with distance from anthill, in A on the other hand it has risen. Ants have an effect not only on the quantity but also on the quality of the DOM, at the same time creating heterogeneity of DOM...
Methods of Data Extraction from the Web
Perina, Lukáš ; Křivka, Zbyněk (referee) ; Burget, Radek (advisor)
The purpose of this bachelor thesis is to design an architecture and subsequent implementation of an application designed for data extraction (web scraping) from web documents. Unlike conventional methods, it is an extraction based on defining data types and regular expressions of requested elements. Extraction is executed in such a manner, where it is not necessary to know the detailed structure of given web document and the possibility of using just one definition to detect requested elements on different web pages. Algorithm is able to achieve overall accuracy of 85,51% and recall 80,28%. This approach can reduce the time required for analysis of web pages significantly and not to take the structure of the code as a determining factor while creating web scraping requests.
Automated Testing of Embedded Web Applications
Dufek, Ondřej ; Rychlý, Marek (referee) ; Burget, Radek (advisor)
This thesis focuses on automatic testing of embedded web applications, specifically on their user interfaces. I solved this problem on an already implemented user interface. I designed and implemented an automated test environment based on the Selenium tool. The created solution facilitates the development of automatic tests of the selected product and enables regression testing of a new version of the product with a minimum time cost. The resulting solution will ensure greater product quality and make work easier for manual testers.
XML Databases for Dictionary Data Management
Samia, Michel ; Dytrych, Jaroslav (referee) ; Smrž, Pavel (advisor)
The following diploma thesis deals with dictionary data processing, especially those in XML based formats. At first, the reader is acquainted with linguistic and lexicographical terms used in this work. Then particular lexicographical data format types and specific formats are introduced. Their advantages and disadvantages are discussed as well. According to previously set criteria, the LMF format has been chosen for design and implementation of Python application, which focuses especially on intelligent merging of more dictionaries into one. After passing all unit tests, this application has been used for processing LMF dictionaries, located on the faculty server of the research group for natural language processing. Finally, the advantages and disadvantages of this application are discussed and ways of further usage and extension are suggested.
On-Line Advertisement System
Zito, Andrej ; Bidlo, Radek (referee) ; Blatný, Petr (advisor)
Tato práce se zabýva návrhem a realizací on-line inzertního systému. Cílem bylo navrhnout a vytvořit funkční aplikaci poskytující základní úlohy pro koncové uživatele umožňující jim přidávat, přehlížet a vyhledávat inzeráty, registrovat se, posílat příspevky a soukromé vzkazy. Systém byl implementován za použití následujících technologií: XHTML, PHP, MySQL, CSS, JavaScript, DOM and AJAX.
Web Page Layout Facilities in JavaScript
T.Kovács, Gregor ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
The aim of this work is to design and implement applications for the creation of web page layout facilities using JavaScript. The work includes the descriptions of the available methods of object positioning using the CSS given possibilities, the CSS 2.1 standard, and the difficulties of object positioning using CSS. Further, it includes the analysis of how the object placement is solved in the Java programming language using grid based layout managers GridLayout and GridBagLayout. Based on the obtained knowledge, designs are created for the solving of object placement in the creation of web pages using the grid principle. The object placement is solved by defining new HTML attributes for position determination, and also by creating a graphical editor for object placement. All the solutions are implemented using JavaScript.
Web Browser Automation
Bastl, Vojtěch ; Polčák, Libor (referee) ; Burget, Radek (advisor)
This work deals with the automation of a web browser - the tools that allow programmatic control of the program for browsing the web pages. First, it discusses the existing solutions with focus on the tools from the Selenium Suite family and PhantomJS. Further, the internal representation of the web pages in the Gecko and WebKit browser engines is discussed. The work then focuses on the web browser application interface available for client-side scripting. The relevant standards are discussed as well. The core part of the thesis is dedicated to the design and implementation of a tool that allows to control a browser using the Selenium WebDriver tool and to extract data about the targert web page. The work presents an internal architecture, configuration files and the application interface of the designed tool. The topic of extracting detailed data about the page and its transformation to a unified structured description is covered as well. Finally, the performed unit tests and tests on real web pages are described.
Grid Layout Implementation in a CSS Rendering Engine
Novák, Ondřej ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
The goal of this thesis is to study the architecture of the CSSBox experimental rendering engine and propose a way to expand library with new options of grid layout content. The opening chapters contain an overview of problematics and subsequently a solution is proposed. The proposed architecture is implemented and tested. The conclusion is dedicated to evaluation of results and options futher development are outlined.

National Repository of Grey Literature : 98 records found   beginprevious41 - 50nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.