National Repository of Grey Literature 6 records found  Search took 0.00 seconds. 
Web Page Segmentation Algorithms Based on Clustering
Lengál, Tomáš ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This report deals with segmentation of web pages, which is important discipline of information extraction. In the first part, we describe several general ways to implement it. After that we introduce method Box Clustering Segmentation, which comes with a slightly different approach towards segmentation. In the second half, we describe implementation of this method as a part of framework FITLayout and final testing.
New Web Page Segmentation Methods
Malaník, Michal ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
The aim of this work is to introduce a new vision based web page segmentation method. This method is based on very popular VIPS segmentation algorithm, which is trying to represent the segmented web document in the same way as it is perceived by a user using a web browser. Compared to the VIPS algorithm, there are some optimizations for modern websites in our method, especially for documents created in the HTML 5 language. We also deal with the implementaion of the proposed method using the FITLayout framework.
Web Interface for a Document Analysis System
Mirská, Olga ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This Bachelor's thesis describes the design and implementation of a web interface for a internet documents analysis system based on existing desktop application FITLayout Framework. The work is divided into theoretical and practical part. The theoretical part describes the technologies used for creating dynamic web pages. The practical part is dedicated to description of design solution and implementation of particular functions of the web application FITLayout. In conclusion the testing results of the web application's functions and their comparation with functions of FITLayout Framework are presented.
Web Page Segmentation Algorithms Based on Clustering
Lengál, Tomáš ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This report deals with segmentation of web pages, which is important discipline of information extraction. In the first part, we describe several general ways to implement it. After that we introduce method Box Clustering Segmentation, which comes with a slightly different approach towards segmentation. In the second half, we describe implementation of this method as a part of framework FITLayout and final testing.
Web Interface for a Document Analysis System
Mirská, Olga ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This Bachelor's thesis describes the design and implementation of a web interface for a internet documents analysis system based on existing desktop application FITLayout Framework. The work is divided into theoretical and practical part. The theoretical part describes the technologies used for creating dynamic web pages. The practical part is dedicated to description of design solution and implementation of particular functions of the web application FITLayout. In conclusion the testing results of the web application's functions and their comparation with functions of FITLayout Framework are presented.
New Web Page Segmentation Methods
Malaník, Michal ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
The aim of this work is to introduce a new vision based web page segmentation method. This method is based on very popular VIPS segmentation algorithm, which is trying to represent the segmented web document in the same way as it is perceived by a user using a web browser. Compared to the VIPS algorithm, there are some optimizations for modern websites in our method, especially for documents created in the HTML 5 language. We also deal with the implementaion of the proposed method using the FITLayout framework.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.