National Repository of Grey Literature 13 records found  1 - 10next  jump to record: Search took 0.00 seconds. 
A Tool for Transformation of PDF to Text
Bujok, Jonáš ; Raab, Jan (advisor) ; Falt, Zbyněk (referee)
Title: A Tool for Transformation of PDF to Text Author: Jonáš Bujok Department: Institute of Formal and Applied Linguistics (32-UFAL) Supervisor: Mgr. Jan Raab, Institute of Formal and Applied Linguistics (32-UFAL) Abstract: In this thesis we described an extraction procedure of text information from PDF (Portable Document Format) files. Thesis is focused mainly on middle-Europe languages. We designed, described and implemented program for this purpose. Besides the program and it's description the thesis contains information about PDF format object structure, it's syntax and logic necessary for proper understanding of text searching principles in PDF file. We also discussed filters, fonts and all other PDF Objects that the program need to process. This thesis also deals with methods and possibilities of improving program's functionality, speed, memory usage, reliability an universality of usage.
Agent-based modelling in economics
Hanuš, Jiří ; Raab, Jan (advisor) ; Bejček, Eduard (referee)
In this thesis, we present one view on economic modelling - Agent based modelling and Agent based computational economics, which is computation- ally intensive method that simulates interactions between entities in econ- omy instead of focusing on stable equilibria. We will provide introduction into Complex systems theory, which explains many phenomena we can see in economies and explain why these phenomena make it hard for economists to design models explaining behavior of people in real world. We will show one possible model of face-to-face interactions between consumers and firms and its implementation, where we can see whether it can sustain for a period of time in dynamic but stable state. 1
A Tool for Transformation of PDF to Text
Bujok, Jonáš ; Raab, Jan (advisor) ; Hauzar, David (referee)
Title: A Tool for Transformation of PDF to Text Author: Jonáš Bujok Department: Institute of Formal and Applied Linguistics (32-UFAL) Supervisor: Mgr. Jan Raab, Institute of Formal and Applied Linguistics (32-UFAL) Abstract: In this thesis we described an extraction procedure of text information from PDF (Portable Document Format) files. Thesis is focused mainly on middle-Europe languages. We designed, described and implemented program for this purpose. Besides the program and it's description the thesis contains information about PDF format object structure, it's syntax and logic necessary for proper understanding of text searching principles in PDF file. We also discussed filters, fonts and all other PDF Objects that the program need to process. This thesis also deals with methods and possibilities of improving program's functionality, speed, memory usage, reliability an universality of usage.
Language recognition performed on a short text sample
Zahornadský, Ján ; Raab, Jan (referee) ; Bejček, Eduard (advisor)
This paper extends the work of Cavnar and Trenkle N-gram text categorization [2], enhances the study of statistics application on document language recognition as simplier variant of categorization. Proposed program shows qualities like modular design or running on one universal character set. As an enhancement of the original work is presented an automatic text sample filtration algorithm altogether with Internet text extraction and iterative improvement for this purpose. Presented paper studies accuracy development, concentrating on short samples. Similar work was not found in available literature, as categorization (and in corollary language recognition) usually assumes long enough input. In conclusion, a discussion about using the learned data and algorithms created here to mark foreign phrases. To be specific, we study the application on Prague Dependency Treebank [8], where the foreign phrases are not recognized, only their occurences specified.
Information system for school
Valentovič, Peter ; Bejček, Eduard (referee) ; Raab, Jan (advisor)
In the present work we study creation of the school information system providing management of website. First, in this work, we deal with the implementation of system, whose advantage is its modularity. Then, the following essential part is description of the implementation of module, namely module for the automated school timetabling. In that part we explain used algorithms and heuristic's methods. Mainly, it consists of algoritms used for solving of constraint satisfaction problem and of used algorithm of local search known as tabu search.
Application for manual word alignment
Sochna, Jan ; Raab, Jan (referee) ; Pecina, Pavel (advisor)
The aim of this work was to design and implement platform-independent fast, flexible and user friendly interface for manual word alignment of bilingual texts. The new interface does not have the imperfections of existing similar tools and improves the performance of manual alignment process. It provides eg. half automatic alignment of simple texts, group operations with alignments, alignment of phrases, enables to shift one sentences along the line to improve the transparency of the alignment process in case that the length of aligned sentences differs substantially. The preceding and succeeding context of currently aligned sentences is shown in both the languages. Last but not least the tool provides the alignment performance statistics. Along with usual "row view", where the two sentences are shown in parallel in two rows, one above the other, being aligned by connections of corresponding words, there were introduced also a "matrix view", where the words in one language stand in for matrix line descriptors, the words in other language stand in for column descriptors and the alignment of two corresponding words is expressed by highlighting of the point of intersection of row and column with corresponding descriptors. It is possible to switch between the both views anytime during the alignment process.
Branch text classification
Čech, Josef ; Spousta, Miroslav (referee) ; Raab, Jan (advisor)
This thesis follows up text categorization. In the first part are described several chosen algorithms for a categorization of documents - the Bayesian model, a categorization with a neural networks and a vector model. Practice part is focused on a algorithm vector model. The vector model is based on idea of two vectors. One vector represents a pattern and second a query. In our case first vector corresponds with a category and the second one with the document. Coordinates of the vector are weights of single words in the text or in the branch depends on, which vector we think about. For comparing are possible to use several procedures like Dice coefficient similarity, Jaccard coefficient or cosine similarity. In my thesis is used cosine similarity. Computing weights is based on frequency of the term in the document and on frequency of documents, which contain the term. Relevant terms are selected on Luhn simple ideas of significance words.
Safety risks of wireless networks
Vyskočil, Vladimír ; Raab, Jan (advisor) ; Peterka, Jiří (referee)
The aim of the work is to review security risks of potencial attacks upon wireless networks.To draw up detailed list and propose suffi cient methods for defend against them. Instruction, which would help to fast recognition type of an attack on given network together with instruction which would minimalize impact of these attacks on users, should be a part of the work.
Multiplatform build system
Kouřil, Přemysl ; Spousta, Miroslav (advisor) ; Raab, Jan (referee)
Build system is an important part of software projects and almost all processes involved in software development are more or less connected to a build system. A complexity of a build system increases accrodingly to a complexity of a software project. The goal of this thesis is to introduce a proposal for a build system suitable for use in highly multiplatform projects and adapted to a specific needs of software developed in enterprise environment. This thesis first defines context and provides an overview on the topic and then analyzes theoretical aspects of key problems. Brief analysis of build tools is included in this thesis. Based on a comparison of available technologies and analysis of key problems a build system proposal is introduced for the specific class of software projects. SCons tool is used as a core of proposed build system. Thesis shall provide a developer with an apparatus strong enough so that developer is able to implement a build system that satisfies all key attributes which determine good build system.

National Repository of Grey Literature : 13 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.