National Repository of Grey Literature 86 records found  beginprevious26 - 35nextend  jump to record: Search took 0.05 seconds. 
Web Tool for Estimation of Document Size
Dlouhá, Simona ; Dobeš, Petr (referee) ; Herout, Adam (advisor)
Aim of this thesis is to design and create web tool, which will allow users to get a text range in standard pages from a PDF file. The thesis introduces algorithms designed for retrieving statistics from a file. The tool is implemented using Python and Django framework, user interface is realized by HTML, CSS and JavaScript. Result of this thesis is a tool that provides text statistics, image statistics and chapters overview. Benefit of this tool is also more precise page counting in documents with images in comparison with other tools. 
Adobe Forms in SAP
Hás, Martin ; Rychlý, Marek (referee) ; Marušinec, Jaromír (advisor)
This master thesis was oriented to study of the development application possibilities in programming language ABAP in the information system mySAP. There were studied integration possibilities of SAP system with Adobe PDF forms and application MS Excel. Advantages and disadvantages of these two technologies were compared. The theoretical part of diploma thesis describes also the technology of SAP systems based on NetWeaver platform and the main product mySAP. The practical part describes a concrete business scenario of purchase order process where is also invoice verification list included. The result of the work is analysis, design and implementation of concrete solution for "invoice verification list" generating in SAP system. A development SAP VUT application was used for implementation and testing.
Conversion of Science Articles to Plain Text
Matička, Jiří ; Dytrych, Jaroslav (referee) ; Otrusina, Lubomír (advisor)
Purpose of this bachelor's work is a research in the area of converting scientific articles in electronic form to plain text. Main topic is the group of problematic articles with certain possible components causing non-acceptable output. Many conversion tools were investigated and the one with the required and most accurate conversion was chosen. Second part of this thesis examines the problematic of automated conversion, including creation of conversion request, forward of all articles to conversion, the conversion itself, detection of finished conversions and delivery of all converted articles. To achieve this objective, a communication principle based on client/server in conjuction with Python scripts and available needed libraries were created. From the client's point of view, it is required only to create a list of articles for conversion and then call the appropriate function (create a request). Rest of the process is taken care of automatically and the resulting text files are available for the client in a folder set beforehand.
Web Page Transformation to Vector Graphics
Nguyen, Hoang Duong ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This bachelor thesis is devoted to the problem of rendering websites with vector graphics. The goal of this thesis is design and implement an extension of the WebVector project, which allows creating the output in PDF format and rendering some specific CSS3 properties. The terms related to vector graphics and its formats are explained. This work describes the structure and features of the CSSBox library, which project WebVector works with, and other related libraries. Then some of the CSS3 properties and also their design and implementation in platform Java are detailly described.
Filtering of Texts Extracted from PDF, OCR or Web
Lehnert, Filip ; Plchot, Oldřich (referee) ; Szőke, Igor (advisor)
The objective of this thesis is to implement a set of scripts to improve the transfer of various types of documents into fully text. There appears noise and not entirely correct character conversion by converting various file formats. These scripts extracted text file cleans so that the resulting text is readable, make sense and does not contain any residues of various characters appearing by the transfer of graphs, tables, formulas, etc. The script works universally and does not require input solely by OCR tools or converting from PDF or web.
Theses Checker
Macková, Michaela ; Chlubna, Tomáš (referee) ; Milet, Tomáš (advisor)
The main goal of this work is to create an application that checks technical reports and marks all the found errors with PDF annotations. The technical documentation of this thesis breaks down the structure of a PDF file, commonly found mistakes in graduate theses, web development using the Django framework and discusses existing libraries for editing PDF documents. The resulting application is implemented in Python and is accessible as a web tool with the help of the Django framework. The developed solution recognizes six mostly typographical errors frequently found in graduate theses. The mistakes found are visually marked and the edited PDF file is then displayed directly on the web page. The resulting tool is freely available and helps students and supervisors to correct the technical reports the students create.
Text editor for seniors
Kudela, Ondřej ; Kubánková, Anna (referee) ; Komosný, Dan (advisor)
The thesis deals with the design and implementation of a simple text editor adapted for seniors in the age group of 90 years and more. The text editor is one of the applications of the operating system for the seniors, which is used to facilitate the work on the computer. The goal of this work is to build an easy to use text editor supporting basic formatting using Markdown markup language and to implement a suitable form of document management and archiving. The work describes the created source code, the translation of the Markdown language into a PDF document, text formatting, file handling, multilingual translation of the application, voice assistance, and the development of the graphical environment. The results of the work are published on the GitHub repository.
Layout-based Data Extraction from Documents
Sedláček, Martin ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This thesis deals with automated data extraction from medical reports in PDF format based on document layout analysis. The main content of the thesis is an introduction to data extraction, a comparison of existing tools and a presentation of the design and requirements of the developed tool, which will be based on the FitLayout application framework. The thesis then describes the actual implementation of the tool in Java and comments on the results achieved by the tool on real data.
Client Server Application for the Creation of File Pages for the PČR
Terbr, Filip ; Beran, Vítězslav (referee) ; Rydlo, Štěpán (advisor)
The diploma thesis is devoted to the design and implementation of a system for editing photographic documentation of the Police of the Czech Republic. The work includes an analysis of currently used technologies, a design of the client application, a design of the server part of the system and a description of the implementation of the client and server parts of the system. The resulting implementation of the server part is written in JavaScript with support for the Express.JS framework, the resulting client application is written in JavaScript using the Electron framework.
Data Extraction from PDF Documents
Bartošák, Michal ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
The work focuses on extracting information from medical records saved in PDF format, which were created by heart pacemakers during regular patient monitoring in the hospital. The result of this work is a desktop application written in Java that retrieves and analyzes data from records using PDFBox and pdf2dom libraries. The output of the application is a CSV file, which represents the acquired values in table form, as well as extracted images that are saved to a user-defined output folder. Application testing on records from three different companies proved that record extraction is highly reliable (with overall precision and recall metrics reaching almost 100 % in every test), provided that the application arguments are correctly set.

National Repository of Grey Literature : 86 records found   beginprevious26 - 35nextend  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.