keywords:"klasifikace textu" - Search Results - Digital Repository

guest :: login Digital Repository
		Search		Submit		Help		About

Home > Search Results: keywords:"klasifikace textu"

Search:

Search Tips :: Advanced Search

Search collections:

Sort by:	Display results:	Output format:

	Actual Events Tracker Odstrčilík, Martin ; Otrusina, Lubomír (referee) ; Kouřil, Jan (advisor) The goal of the master thesis project was to develop an application for tracking of actual events in the surrounding area of the users. This application should allow the users to view events, create new events and add comments to existing ones. Beyond the implementation of developed application, this project deals with an analysis of the presented problem. The analysis includes a comparison with existing solutions and search for available technologies and frameworks applicable for implementation. Another part inside this work is description of the theory in behind of data classification that is internally used for event and comment analysis. This work also includes a design of appliction including design of user interface, software architecture, database, communication protocol and data classifiers. The main part of this project, the implementation, is described aftewards. At the end of this work, there is a summary of the whole process and also there are given some ideas about enhancing the application in the future. Detailed record
	Adaptive RSS Reader Luža, Jindřich ; Otrusina, Lubomír (referee) ; Smrž, Pavel (advisor) Purpose of this balcheor thesis is posibility to enhance common RSS reader by extension, which allowing user filter RSS feed depends on that's classification by content to groups.There is discussed problems in common classification and in text classification. Forth, there is reveal teoretical aspect of RSS format, which is needed to be considered in implementation of RSS reader module and prototype of module. At last, testing of used classifier is stated here. Detailed record
	Artificial Intelligence Document Classification Molnár, Ondřej ; Kačic, Matej (referee) ; Třeštíková, Lenka (advisor) This paper deals with document classification using artificial intelligence. It describes the principles of classification and machine learning. It also introduces AI methods and presents Naive Bayes classification method in detail. Provides practical implementation of the classifier in MS Office and discusses other possible extensions. Detailed record
	Porovnání open-source nástrojů pro strojové učení Poliakova, Yevheniia Poliakova, Y. Comparison of open-source tools for machine learning. Thesis. Brno: Mendel University in Brno, 2022. This work is devoted to the research of accessible open source artificial intelligence. The thesis describes a selected list of available artificial intelligence tools and the use of these tools for specific tasks. The main contribution of the work is the comparison of open-source tools using experiments focused on inductively controlled (supervised, classification) knowledge acquisition from large volumes of text and data. These experiments will be performed using selected open-source tools. The result of the work will be a conclusion about the advantages and disadvantages of the already mentioned platforms, their characteristics in solving specific problems and recommendations for choosing a platform according to the assigned task or data. Detailed record
	Assessment and implementation of text data preprocessing in neural network models Ratnasari, Febiyanti In the realm of text data processing, text preprocessing has traditionally played a significant role. However, with the growing prominence of neural network models and novel representations of textual data, the importance of text preprocessing has been relatively understated. To address this, the present research endeavors to investigate the potential benefits of employing a composite of multiple text data preprocessing techniques in conjunction with a neural network-based text processing model. Detailed record
	Text Classification Methods in the Context of Web Pages Trstenský, Patrik ; Bartík, Vladimír (referee) ; Burget, Radek (advisor) This work deals with the issue of text classification in the context of websites. It examines available classification methods and their accuracy over web page plain text. It deals with constructing a dataset for training these methods for a specific domain. We obtain data for creating the dataset from publicly available websites that utilize RDF documents defined in HTML code. The conclusion of the work consists of the creation of two datasets for two different domains. Furthermore, the use of these datasets for training models and testing of their accuracy. Detailed record
	Crude Oil Price Forecast based on Text News Skalický, Jan ; Bojar, Ondřej (advisor) ; Žabokrtský, Zdeněk (referee) For crude oil price forecast, there is a whole range of algorithms. In this thesis we bring out a new perspective on this issue and introduce our project COPF. Using a maximum entropy classifier, we try to predict the change in crude oil price from text information available on the Internet. We are taking advantage of the knowledge of experts in the field. As a part of the thesis, we tested and improved COPF precision. We have found out that this approach poses a lot of interesting problems. In the current state, the precision of our prediction surpassed the baseline but for further development, it is necessary to obtain more data sources. Our algorithm has never been regarded as a self-standing method but it may nicely complement numerical algorithms. Detailed record
	Popularity Meter Hajič, Jan ; Bojar, Ondřej (advisor) ; Popel, Martin (referee) Having the possibility of automatically tracking a person's popularity in the newspapers is an idea appealing not just to those in the media spotlight. While sentiment (subjectivity) analysis is a rapidly growing subfield of computational linguistics, no data from the news domain are yet available for Czech. We have therefore started building a manually annotated polarity corpus of sentences from Czech news texts; however, these texts have proven themselves rather unwieldy for such processing. We have also designed a classifier which should be able to track popularity based on this corpus; the classifier has been tested on a corpus of product reviews of domestic appliances and some introductory testing has been done on the nascent news corpus. As a model, we simply extract a unigram polarity lexicon from the data. We then use three related methods for identifying lemma polarity and a number of simple filters for feature selection. On the domestic appliance data, our simplest model has achieved results comparable to the state of the art, however, the properties of Czech news texts and preliminary results hint a more linguistically oriented approach might be preferrable. Detailed record
	Analýza textových používateľských hodnotení vybranej skupiny produktov Valovič, Roman This work focuses on the design of a system that identifies frequently discussed product features in product reviews, summarizes them, and displays them to the user in terms of sentiment. The work deals with the issue of natural language processing, with a specific focus on Czech languague. The reader will be introduced the methods of preprocessing the text and their impact on the quality of the analysis results. The identification of the mainly discussed products features is carried out by cluster analysis using the K-Means algorithm, where we assume that sufficiently internally homogeneous clusters will represent the individual features of the products. A new area that will be explored in this work is the representation of documents using the Word embeddings technique, and its potential of using vector space as input for machine learning algorithms. Detailed record
	Extraction of Semantic Relations from Text Pospíšil, Milan ; Schmidt, Marek (referee) ; Smrž, Pavel (advisor) Today exists many semi-structured documents, whitch we want convert to structured form. Goal of this work is create a system, that make this task more automatized. That could be difficult problem, because most of these documents are not generated by computer, so system have to tolerate differences. We also need some semantic understanding, thats why we choose only domain of meeting minutes documents. Detailed record

Interested in being notified about new results for this query?
Subscribe to the RSS feed.

Digital Repository :: :: :: ::
Powered by v1.1.2
Maintained by

This site is also available in the following languages:
Česky English