National Repository of Grey Literature 12 records found  1 - 10next  jump to record: Search took 0.01 seconds. 
Semantic relation extraction from unstructured data in the business domain
Rampula, Ilana ; Pecina, Pavel (advisor) ; Kuboň, Vladislav (referee)
Text analytics in the business domain is a growing field in research and practical applications. We chose to concentrate on Relation Extraction from unstructured data which was provided by a corporate partner. Analyzing text from this domain requires a different approach, counting with irregularities and domain specific attributes. In this thesis, we present two methods for relation extraction. The Snowball system and the Distant Supervision method were both adapted for the unique data. The methods were implemented to use both structured and unstructured data from the database of the company. Keywords: Information Retrieval, Relation Extraction, Text Analytics, Distant Supervision, Snowball
Content-based exploration of unstructured data
Čech, Přemysl ; Lokoč, Jakub (advisor) ; Barthel, Kai Uwe (referee) ; Gudmundsson, Gylfi Thor (referee)
Effective analysis, searching and browsing throughout arbitrary multimedia collections is still a challenging task. To perform a search among multimedia objects, first, a similarity model has to be defined. Such a model establishes methods describing how the content of individual objects is processed and how key features and descriptors, that are used for modeling similarity between objects, are formed. This task is not trivial since there can be many ways of determining how to comprehend the content of multimedia data. Furthermore, with the growing size of contemporary database collections, multimedia retrieval and exploration are extremely computationally intensive. Hence, researchers investigate support indexing structures that can evaluate similarity queries and can respond to user's queries in almost real-time even on datasets counting billions of objects. Another very important aspect of a retrieval system is the user interface for defining queries as well as presenting retrieved results. A multimedia system should offer various inputs for formulating user's queries, especially for situations in which a user cannot provide an ideal query example. Finally, a well- arranged and easy to read interface for visualization of retrieved results is essential for the success of a multimedia exploration and...
Design and Implementation of System for Aggregations of Real Estate Offers in the Czech Republic
Drobník, Jakub ; Kučera, Jan (advisor) ; Chlapek, Dušan (referee)
The diploma thesis deals with the design and implementation of software for aggregations of real estate offers in the Czech Republic. The aim of the thesis is to create a system which aggregates the data of real estate offers from web pages. This thesis consists of two basic parts. The context of creating the system is described in the first part. The author discusses ways to retrieve data from websites - especially the extraction of data using automated robots - in the first part of the thesis. The design and implementation of the system are described in the second part. The author and sponsor define requirements for the system in the second part of the thesis. The outcome of this thesis is a prototype that aggregates data from real estate portals into the prepared database. The main contribution of the thesis is an example of a possible approach that can aggregate data from a particular market segment and put it into the database.
Application of text mining methods for analysis of users movie reviews
Palatínus, Vojtěch ; Matějka, Martin (advisor) ; Novotný, Ota (referee)
The topic of this thesis is to define the challenges while working with the unstructured data. It focuses, specifically, on a transformation between unstructured and structured data using text mining methods and bringing the closer view on so-called Big Data phenomenon. The goal of this thesis is to introduce problems that occur when working with unstructured data, to show their transformation to structured data format using text mining methods and to perform analysis on user reviews published on the website of The Internet Movie Database from the mined data. The aim of this thesis is to familiarize the reader with the unstructured data and on the example demonstrate how to use text mining methods for mining relevant information from this type of data.
Semantic relation extraction from unstructured data in the business domain
Rampula, Ilana ; Pecina, Pavel (advisor) ; Kuboň, Vladislav (referee)
Text analytics in the business domain is a growing field in research and practical applications. We chose to concentrate on Relation Extraction from unstructured data which was provided by a corporate partner. Analyzing text from this domain requires a different approach, counting with irregularities and domain specific attributes. In this thesis, we present two methods for relation extraction. The Snowball system and the Distant Supervision method were both adapted for the unique data. The methods were implemented to use both structured and unstructured data from the database of the company. Keywords: Information Retrieval, Relation Extraction, Text Analytics, Distant Supervision, Snowball
Usage of unstructured data in Business Intelligence
Rakhmanova, Malika ; Šperková, Lucie (advisor) ; Karkošková, Soňa (referee)
The aim of the thesis is to identify the main trends that are occurring in the market of Business Intelligence and related to unstructured data, to describe the possibilities for integrating unstructured data, to clarify what the impact on the company have the results that can be obtained using these solutions and how generally incorporate an analysis of unstructured data into BI. Another aim is to show the current situation of processing unstructured data on the example of BI system. The thesis is divided into several parts. First part is describing of the Business Intelligence area and the basic components of Business Intelligence, as well as identifying market trends. Then, there is the next part: separating the data into structured and unstructured. Here is the part about how you can access and analyse unstructured data and what is their place in BI systems. This is the end of a block of unstructured data and the beginning of a description of the enhanced version of BI. Finally, the current market situation and BI tools, which include unstructured data, are introduced. This section provides an overview of how BI tools approach to analyse unstructured data. Existed literature, professional and freely available Internet resources are used for writing the work. The purpose is to serve as a source of information for quickly orienting in the current situation, to serve as a guide to the world of BI solutions and to show potential users what are the options and functionality of these BI solutions.
Open data in the agrarian sector
Martinec, Radomil ; Jarolímek, Jan (advisor) ; Ivo, Ivo (referee)
This Master's Thesis Open Data in the Agrarian Sector is interested in the usage of open data in the particular sector of the national economy. The aim of the search is to approach the theoretical background of the issue. The emphasis is predominantly put on the definition of open data and other related subjects. The practical part covers the use of open data in the Czech agrarian sector. The main goal is to analyze the contemporary state of open data use of its selected institutions. After the analysis outcomes, there follows the practical demonstration of selected data opening in this sector (Grant Recipient List). The project prototype is focused on the transfer of selected data in the open format, its publication and visualization possibilities. The last part deals with the suggestion for the possibility of opening data for Czech agrarian institutions; its evaluation from some of the perspectives is also included. The conclusion provides the general evaluation of the contemporary open data use and the summary of the given suggestion on their spread.
The analyses of unstructured content from publicly available social media by Watson
Šverák, Martin ; Molnár, Zdeněk (advisor) ; Hawlová, Kateřina (referee)
This graduate thesis deals with the analysis of unstructured data from public social media. In particular, it deals with the analysis of data from social media of Vodafone Czech Republic a.s. This thesis is divided into two parts. The first part provides theoretical background for the second part. Therefore, the first part describes social media, structured and unstructured data and tools which are used for analysing of unstructured data. In the second part, tool Watson is used for the analysis of publicly available data. Then, methodology is designed to control the analysis process and subsequently this methodology used in the formation of the pilot application that has to verify the functionality of unstructured data by tool Watson. The results of the analysis are in the conclusion. The main benefits of this thesis are the development of a pilot application of Watson and the verification of its functionality. The pilot application cannot be equated with a complete analysis that can be done by Watson. But this pilot application may work as a demonstration of Watson's functionalities.
Aplikace metod předzpracování při dolování znalostí z textových dat
Kotíková, Michaela
The diploma thesis focuses on unstructured textual data preprocessing in relation to text mining. A series of experiments oriented to text mining is designed and carried out. The effect of different techniques of textual data preprocessing to the entire text mining process and its results is evaluated based on output of the experiments.
Competitive analysis of leading ICT companies on the Czech market
Dvořák, Oskar ; Feige, Tomáš (advisor) ; Molnár, Zdeněk (referee)
This thesis deals with the field of Competitive Intelligence in relation to the possibilities of application of its methods and tools for competitive analysis of the market environment using modern virtual social networks. Theoretical part focuses on the characteristics of the market environment of ICT companies by using Porter's analysis and then it is focused on the description of selected tools and methods used to processing unstructured data and social networks analysis. The practical part is based on a real project which ran from early March 2013 at IBM Company. Practical part demonstrates current possibilities of information technology in the field of Competitive Intelligence.

National Repository of Grey Literature : 12 records found   1 - 10next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.