National Repository of Grey Literature 11 records found  previous11 - 11  jump to record: Search took 0.01 seconds. 
The possibilities of automated extraction of data from publicly available sources
Jelínek, Martin ; Maryška, Miloš (advisor) ; Pavlíčková, Jarmila (referee)
The theoretical part of this work describes some options that can be used to retrieve data from different information sources. It also discusses the possibility of automatic data processing and tools and technologies that can be used to do this. Mainly technologies which can be used to store acquired data and to analyze them. This includes description of different types of databases or data mining methods. The practical part of this work is devoted to the creation of an application for automatic downloading of articles from news sites. The application allows you to download articles from selected news sites, and save parsed article text into a file and additional information to the database. The purpose of the application is mainly collecting data that can be used for further analysis . The application allows searching in downloaded articles using keywords, create topic groups from articles and monitor articles history. This allows for example to monitor possible differences between articles whitch belongs to the same topic and were downloaded from different news sites or to monitor progress in some topic. Another motivation is archiving of old articles for further analysis, because the articles on news sites are constantly changing.

National Repository of Grey Literature : 11 records found   previous11 - 11  jump to record:
Interested in being notified about new results for this query?
Subscribe to the RSS feed.