National Repository of Grey Literature 1 records found  Search took 0.05 seconds. 
Layout-based Data Extraction from Documents
Sedláček, Martin ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This thesis deals with automated data extraction from medical reports in PDF format based on document layout analysis. The main content of the thesis is an introduction to data extraction, a comparison of existing tools and a presentation of the design and requirements of the developed tool, which will be based on the FitLayout application framework. The thesis then describes the actual implementation of the tool in Java and comments on the results achieved by the tool on real data.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.