National Repository of Grey Literature 2 records found  Search took 0.01 seconds. 
Text Layout Analysis in Historical Documents
Palacková, Bianca ; Hradiš, Michal (referee) ; Kodym, Oldřich (advisor)
The goal of this thesis is to design and implement algorithm for text layout analysis in historical documents. Neural network was used to solve this problem, specifically architecture Faster-RCNN. Dataset of 6 135 images with historical newspaper was used for training and testing. For purpose of the thesis four models of neural networks were trained: model for detection of words, headings, text regions and model for words detection based on position in line. Outputs from these models were processed in order to determine text layout in input image. A modified F-score metric was used for the evaluation. Based on this metric, the algorithm reached an accuracy almost 80 %.
Text Layout Analysis in Historical Documents
Palacková, Bianca ; Hradiš, Michal (referee) ; Kodym, Oldřich (advisor)
The goal of this thesis is to design and implement algorithm for text layout analysis in historical documents. Neural network was used to solve this problem, specifically architecture Faster-RCNN. Dataset of 6 135 images with historical newspaper was used for training and testing. For purpose of the thesis four models of neural networks were trained: model for detection of words, headings, text regions and model for words detection based on position in line. Outputs from these models were processed in order to determine text layout in input image. A modified F-score metric was used for the evaluation. Based on this metric, the algorithm reached an accuracy almost 80 %.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.