National Repository of Grey Literature 6 records found  Search took 0.00 seconds. 
Advanced Web Crawler
Činčera, Jaroslav ; Jirák, Ota (referee) ; Trchalík, Roman (advisor)
This Master's thesis describes design and implementation of advanced web crawler. This crawler can be configured by user and is designed for web browsing according to specified parameters. Can acquire and evaluate content of web pages. Its configuration is performed by creating projects which are consisting of different types of steps. User can create simple action like downloading page, form submission, etc. or can create more complex and larger projects.
Automatic Web Form Processing
Zdráhal, Petr ; Kolář, Dušan (referee) ; Burget, Radek (advisor)
This thesis deals with the web form automatization. It contains a short introduction to the HTML and XHTML markup languages and their facilities for the definition of forms. Next, there is a short overview of the HTTP protocol and XML markup language and a method of the form description using XML is proposed. The thesis contains a description of tool for creating XML file from HTML document and a description of a data sender tool using a CSV data file and an XML form description. Next, there is a description of the algorithms used in the implementation. The conclusion includes the achieved results and the improvements for the future.
Data Extraction from Dynamic Web Pages
Puna, Petr ; Kunc, Michael (referee) ; Burget, Radek (advisor)
This work contains a brief overview of technologies for representation and obtaining data on WWW and describes selected web data extraction tools. The work designs a new tool for obtaining pages generated by filling in web forms, which allows its user to define data on such web pages and which can extract those data and offer it in a XML format suitable for future machine processing.
Advanced Web Crawler
Činčera, Jaroslav ; Jirák, Ota (referee) ; Trchalík, Roman (advisor)
This Master's thesis describes design and implementation of advanced web crawler. This crawler can be configured by user and is designed for web browsing according to specified parameters. Can acquire and evaluate content of web pages. Its configuration is performed by creating projects which are consisting of different types of steps. User can create simple action like downloading page, form submission, etc. or can create more complex and larger projects.
Automatic Web Form Processing
Zdráhal, Petr ; Kolář, Dušan (referee) ; Burget, Radek (advisor)
This thesis deals with the web form automatization. It contains a short introduction to the HTML and XHTML markup languages and their facilities for the definition of forms. Next, there is a short overview of the HTTP protocol and XML markup language and a method of the form description using XML is proposed. The thesis contains a description of tool for creating XML file from HTML document and a description of a data sender tool using a CSV data file and an XML form description. Next, there is a description of the algorithms used in the implementation. The conclusion includes the achieved results and the improvements for the future.
Data Extraction from Dynamic Web Pages
Puna, Petr ; Kunc, Michael (referee) ; Burget, Radek (advisor)
This work contains a brief overview of technologies for representation and obtaining data on WWW and describes selected web data extraction tools. The work designs a new tool for obtaining pages generated by filling in web forms, which allows its user to define data on such web pages and which can extract those data and offer it in a XML format suitable for future machine processing.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.