Original title: Strigil: A framework for data extraction
Authors: Zvirinský, Peter
Document type: Papers
Conference/Event: Big Clean 2012, Prague (CZ), 2012-11-03
Year: 2012
Language: eng
Abstract: Data scraping is a way to gather and integrate data from different data sources. In this presentation, we will describe Strigil, a framework for automatized screen-scraping. It allows to define custom scraping scripts in intuitive graphical user interface and provides a solution for scalable and distributed scraping.
Keywords: BigClean; data collection; data scraping; Open Data; Strigil; BigClean; otevřená data; sběr dat; Strigil; získávání dat
Rights: This work is protected under the Copyright Act No. 121/2000 Coll.; License: Creative Commons Attribution 3.0 Czech Republic

Institution: National Library of Technology (web)

Permalink: http://www.nusl.cz/ntk/nusl-126849



The record appears in these collections:
Culture > Libraries > National Library of Technology
Conference materials > Papers
 Record created 2012-11-09, last modified 2023-12-11


If you can´t see the document in your browser, save it to your PC and open it in a suitable application.
  • Export as DC, NUŠL, RIS
  • Share