National Repository of Grey Literature 2 records found  Search took 0.00 seconds. 
Cleaning, extraction of text and transformation of web pages into vertical format
Švaňa, Miloš ; Otrusina, Lubomír (referee) ; Dytrych, Jaroslav (advisor)
This thesis deals with the topic of extraction of text from web page, recognition of important contents and its transformation to vertical format, which can be used as a suitable input for other natural language processing tasks. It analyzes the existing solution and its components with emphasis on its disadvantages and describes the design and implementation of new solution based on obtained knowledge.
Cleaning, extraction of text and transformation of web pages into vertical format
Švaňa, Miloš ; Otrusina, Lubomír (referee) ; Dytrych, Jaroslav (advisor)
This thesis deals with the topic of extraction of text from web page, recognition of important contents and its transformation to vertical format, which can be used as a suitable input for other natural language processing tasks. It analyzes the existing solution and its components with emphasis on its disadvantages and describes the design and implementation of new solution based on obtained knowledge.

See also: similar author names
2 Švaňa, Matej
Interested in being notified about new results for this query?
Subscribe to the RSS feed.