Original title:
ScraperWiki Tutorial
Authors:
Levine, Thomas Document type: Papers Conference/Event: Big Clean 2012, Prague (CZ), 2012-11-03
Year:
2012
Language:
eng Abstract:
The objective of the workshop, or better hackathon, was to get the data into a structured format, and join it with data from another sources – together with an overview and showing by example what is possible with scraping. Thomas identified targets for web scraping and navigating the complexity of different types of web pages and introduced that in a few half-hour-long and hour-long modules that catered to different audiences.
Keywords:
BigClean; data cleaning; data theory; scraping; structured data; BigClean; sbírání dat; strukturované data; teorie dat; čištění dat
Rights: This work is protected under the Copyright Act No. 121/2000 Coll.; License: Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Czech Republic