National Repository of Grey Literature 26 records found  previous7 - 16next  jump to record: Search took 0.00 seconds. 
Adaptive RSS Reader
Luža, Jindřich ; Otrusina, Lubomír (referee) ; Smrž, Pavel (advisor)
Purpose of this balcheor thesis is posibility to enhance common RSS reader by extension, which allowing user filter RSS feed depends on that's classification by content to groups.There is discussed problems in common classification and in text classification. Forth, there is reveal teoretical aspect of RSS format, which is needed to be considered in implementation of RSS reader module and prototype of module. At last, testing of used classifier is stated here.
Artificial Intelligence Document Classification
Molnár, Ondřej ; Kačic, Matej (referee) ; Třeštíková, Lenka (advisor)
This paper deals with document classification using artificial intelligence. It describes the principles of classification and machine learning. It also introduces AI methods and presents Naive Bayes classification method in detail. Provides practical implementation of the classifier in MS Office and discusses other possible extensions.
Porovnání open-source nástrojů pro strojové učení
Poliakova, Yevheniia
Poliakova, Y. Comparison of open-source tools for machine learning. Thesis. Brno: Mendel University in Brno, 2022. This work is devoted to the research of accessible open source artificial intelligence. The thesis describes a selected list of available artificial intelligence tools and the use of these tools for specific tasks. The main contribution of the work is the comparison of open-source tools using experiments focused on inductively controlled (supervised, classification) knowledge acquisition from large volumes of text and data. These experiments will be performed using selected open-source tools. The result of the work will be a conclusion about the advantages and disadvantages of the already mentioned platforms, their characteristics in solving specific problems and recommendations for choosing a platform according to the assigned task or data.
Assessment and implementation of text data preprocessing in neural network models
Ratnasari, Febiyanti
In the realm of text data processing, text preprocessing has traditionally played a significant role. However, with the growing prominence of neural network models and novel representations of textual data, the importance of text preprocessing has been relatively understated. To address this, the present research endeavors to investigate the potential benefits of employing a composite of multiple text data preprocessing techniques in conjunction with a neural network-based text processing model.
Text Classification Methods in the Context of Web Pages
Trstenský, Patrik ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This work deals with the issue of text classification in the context of websites. It examines available classification methods and their accuracy over web page plain text. It deals with constructing a dataset for training these methods for a specific domain. We obtain data for creating the dataset from publicly available websites that utilize RDF documents defined in HTML code. The conclusion of the work consists of the creation of two datasets for two different domains. Furthermore, the use of these datasets for training models and testing of their accuracy.
Crude Oil Price Forecast based on Text News
Skalický, Jan ; Bojar, Ondřej (advisor) ; Žabokrtský, Zdeněk (referee)
For crude oil price forecast, there is a whole range of algorithms. In this thesis we bring out a new perspective on this issue and introduce our project COPF. Using a maximum entropy classifier, we try to predict the change in crude oil price from text information available on the Internet. We are taking advantage of the knowledge of experts in the field. As a part of the thesis, we tested and improved COPF precision. We have found out that this approach poses a lot of interesting problems. In the current state, the precision of our prediction surpassed the baseline but for further development, it is necessary to obtain more data sources. Our algorithm has never been regarded as a self-standing method but it may nicely complement numerical algorithms.
Popularity Meter
Hajič, Jan ; Bojar, Ondřej (advisor) ; Popel, Martin (referee)
Having the possibility of automatically tracking a person's popularity in the newspapers is an idea appealing not just to those in the media spotlight. While sentiment (subjectivity) analysis is a rapidly growing subfield of computational linguistics, no data from the news domain are yet available for Czech. We have therefore started building a manually annotated polarity corpus of sentences from Czech news texts; however, these texts have proven themselves rather unwieldy for such processing. We have also designed a classifier which should be able to track popularity based on this corpus; the classifier has been tested on a corpus of product reviews of domestic appliances and some introductory testing has been done on the nascent news corpus. As a model, we simply extract a unigram polarity lexicon from the data. We then use three related methods for identifying lemma polarity and a number of simple filters for feature selection. On the domestic appliance data, our simplest model has achieved results comparable to the state of the art, however, the properties of Czech news texts and preliminary results hint a more linguistically oriented approach might be preferrable.
Analýza textových používateľských hodnotení vybranej skupiny produktov
Valovič, Roman
This work focuses on the design of a system that identifies frequently discussed product features in product reviews, summarizes them, and displays them to the user in terms of sentiment. The work deals with the issue of natural language processing, with a specific focus on Czech languague. The reader will be introduced the methods of preprocessing the text and their impact on the quality of the analysis results. The identification of the mainly discussed products features is carried out by cluster analysis using the K-Means algorithm, where we assume that sufficiently internally homogeneous clusters will represent the individual features of the products. A new area that will be explored in this work is the representation of documents using the Word embeddings technique, and its potential of using vector space as input for machine learning algorithms.
Extraction of Semantic Relations from Text
Pospíšil, Milan ; Schmidt, Marek (referee) ; Smrž, Pavel (advisor)
Today exists many semi-structured documents, whitch we want convert to structured form. Goal of this work is create a system, that make this task more automatized. That could be difficult problem, because most of these documents are not generated by computer, so system have to tolerate differences. We also need some semantic understanding, thats why we choose only domain of meeting minutes documents.
Comparison of approaches to text classification
Knížek, Jan ; Hana, Jiří (advisor) ; Vidová Hladká, Barbora (referee)
The focus of this thesis is short text classification. Short text is the prevailing form of text on e-commerce and review platforms, such as Yelp, Tripadvisor or Heureka. As the popularity of the online communication is increasing, it is becoming infeasible for users to filter information manually. It is therefore becoming more and more important to recog- nise the relevant information in text. Classification of reviews is especially challenging, because they have limited structure, use informal language, contain a high number of errors and rely heavily on context and common knowledge. One of the possible appli- cations of machine learning is to automatically filter data and show users only relevant pieces of information. We work with restaurant reviews from Yelp and aim to predict their usefulness. Most restaurants have relatively many reviews, yet only few are truly useful. Our objective is to compare machine learning methods for predicting usefulness. 1

National Repository of Grey Literature : 26 records found   previous7 - 16next  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.