National Repository of Grey Literature 7 records found  Search took 0.01 seconds. 
Content Based Photo Search
Dvořák, Pavel ; Beran, Vítězslav (referee) ; Španěl, Michal (advisor)
This thesis covers design and practical realization of a tool for quick search in large image databases, containing from tens to hundreds of thousands photos, based on image similarity. The proposed technique uses various methods of descriptor extraction, creation of Bag of Words dictionaries and methods of storing image data in PostgreSQL database. Further, experiments with the implemented software were carried out to evaluate the search time effectivity and scaling possibilities of the design solution.
Duplicate Text Identification
Pekař, Tomáš ; Kouřil, Jan (referee) ; Smrž, Pavel (advisor)
The aim of this work is to design and implement a system for duplicate text identification. The application should be able to index documents and also searching documents at index. In our work we deal with preprocessing documents, their fragmentation and indexing. Furthermore we analyze methods for duplicate text identification, that are also linked with strategies for selecting substrings. The thesis includes a description of the basic data structures that can be used to index n-grams.
Automatically Updated Bibliography
Valo, Boris ; Škoda, Petr (referee) ; Smrž, Pavel (advisor)
This paper describes the development of application for automatically updated bibliography. Nowadays, many Internet users search informations they need, this is important especially in sets of scientific publications and articles. The aim of this thesis is convenient tool for users to create their own portal. This is achieved by storing documents and their subsequent search using ElasticSearch. Retrieval is made by Boolean queries and additional search using similarity search tool MoreLikeThis. At the end of this thesis is described the way of testing and evaluation of retrieval.
Design of search engine for modern needs
Maršálek, Tomáš ; Palovská, Helena (advisor) ; Strossa, Petr (referee)
In this work I argue that field of text search has focused mostly on long text documents, but there is a growing need for efficient short text search, which has different user expectations. Due to this reduced data set size requirements different algorithmic techniques become more computationally affordable. The focus of this work is on approximate and prefix search and purely text based ranking methods, which are needed due to lower precision of text statistics on short text. A basic prototype search engine has been created using the researched techniques. Its capabilities were demonstrated on example search scenarios and the implementation was compared to two other open source systems representing currently recommended approaches for short text search problem. The results show feasibility of the implemented prototype regarding both user expectations and performance. Several options of future direction of the system are proposed.
Automatically Updated Bibliography
Valo, Boris ; Škoda, Petr (referee) ; Smrž, Pavel (advisor)
This paper describes the development of application for automatically updated bibliography. Nowadays, many Internet users search informations they need, this is important especially in sets of scientific publications and articles. The aim of this thesis is convenient tool for users to create their own portal. This is achieved by storing documents and their subsequent search using ElasticSearch. Retrieval is made by Boolean queries and additional search using similarity search tool MoreLikeThis. At the end of this thesis is described the way of testing and evaluation of retrieval.
Content Based Photo Search
Dvořák, Pavel ; Beran, Vítězslav (referee) ; Španěl, Michal (advisor)
This thesis covers design and practical realization of a tool for quick search in large image databases, containing from tens to hundreds of thousands photos, based on image similarity. The proposed technique uses various methods of descriptor extraction, creation of Bag of Words dictionaries and methods of storing image data in PostgreSQL database. Further, experiments with the implemented software were carried out to evaluate the search time effectivity and scaling possibilities of the design solution.
Duplicate Text Identification
Pekař, Tomáš ; Kouřil, Jan (referee) ; Smrž, Pavel (advisor)
The aim of this work is to design and implement a system for duplicate text identification. The application should be able to index documents and also searching documents at index. In our work we deal with preprocessing documents, their fragmentation and indexing. Furthermore we analyze methods for duplicate text identification, that are also linked with strategies for selecting substrings. The thesis includes a description of the basic data structures that can be used to index n-grams.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.