recid:458807 - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: recid:458807

Search:



Search Tips :: Simple Search

Search collections:

Sort by:	Display results:	Output format:

National Repository of Grey Literature	1 records found	Search took 0.01 seconds.

Warning: Requested record does not seem to exist.

Data extraction from document scans
Macháč, Bohuslav ; Kolomazník, Jan (advisor) ; Krajíček, Václav (referee)
In this work I developed an application capable of extracting data from scanned documents. For optical character recognition, I used external OCR engine Tesseract, but it can be easily changed. I use document templates, which have informations about data areas and its data types. I tried to automatize most of the steps which are required to extract data or create new data template. User can improve or change results of these steps. For export from application I implemented components, which export data to XML, HTML or plain text. Another components can be easily added, to adapt application for various uses.

Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English