National Repository of Grey Literature 2 records found  Search took 0.00 seconds. 
Filtering of Texts Extracted from PDF, OCR or Web
Lehnert, Filip ; Plchot, Oldřich (referee) ; Szőke, Igor (advisor)
The objective of this thesis is to implement a set of scripts to improve the transfer of various types of documents into fully text. There appears noise and not entirely correct character conversion by converting various file formats. These scripts extracted text file cleans so that the resulting text is readable, make sense and does not contain any residues of various characters appearing by the transfer of graphs, tables, formulas, etc. The script works universally and does not require input solely by OCR tools or converting from PDF or web.
Filtering of Texts Extracted from PDF, OCR or Web
Lehnert, Filip ; Plchot, Oldřich (referee) ; Szőke, Igor (advisor)
The objective of this thesis is to implement a set of scripts to improve the transfer of various types of documents into fully text. There appears noise and not entirely correct character conversion by converting various file formats. These scripts extracted text file cleans so that the resulting text is readable, make sense and does not contain any residues of various characters appearing by the transfer of graphs, tables, formulas, etc. The script works universally and does not require input solely by OCR tools or converting from PDF or web.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.