Original title: ScraperWiki Tutorial
Authors: Levine, Thomas
Document type: Papers
Conference/Event: Big Clean 2012, Prague (CZ), 2012-11-03
Year: 2012
Language: eng
Abstract: The objective of the workshop, or better hackathon, was to get the data into a structured format, and join it with data from another sources – together with an overview and showing by example what is possible with scraping. Thomas identified targets for web scraping and navigating the complexity of different types of web pages and introduced that in a few half-hour-long and hour-long modules that catered to different audiences.
Keywords: BigClean; data cleaning; data theory; scraping; structured data; BigClean; sbírání dat; strukturované data; teorie dat; čištění dat
Rights: This work is protected under the Copyright Act No. 121/2000 Coll.; License: Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Czech Republic

Institution: National Library of Technology (web)

Permalink: http://www.nusl.cz/ntk/nusl-127015


The record appears in these collections:
Culture > Libraries > National Library of Technology
Conference materials > Papers
 Record created 2012-11-19, last modified 2023-12-11


If you can´t see the document in your browser, save it to your PC and open it in a suitable application.
  • Export as DC, NUŠL, RIS
  • Share