National Repository of Grey Literature 2 records found  Search took 0.01 seconds. 
The Most Frequent Word n-Grams
Holec, Matúš ; Szőke, Igor (referee) ; Smrž, Pavel (advisor)
This thesis deals with design and implementation of effective system for word n-grams extraction from texts. System is based on batch processing therefore it is able to process large text corpuses. The first part contains principles of existing methods for an n-gram extraction. The next part includes description of the implemented system as well as the approach of acceleration system by paralelizing the batch processing. The last part contains efficiency comparison between available implementations and designed system and time complexity comparison between sequential and paralelized approach.
The Most Frequent Word n-Grams
Holec, Matúš ; Szőke, Igor (referee) ; Smrž, Pavel (advisor)
This thesis deals with design and implementation of effective system for word n-grams extraction from texts. System is based on batch processing therefore it is able to process large text corpuses. The first part contains principles of existing methods for an n-gram extraction. The next part includes description of the implemented system as well as the approach of acceleration system by paralelizing the batch processing. The last part contains efficiency comparison between available implementations and designed system and time complexity comparison between sequential and paralelized approach.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.