GIJC15 has ended
Welcome to GIJC15! We have more than 160 panels, workshops, and special events planned for the conference. Be sure to register so you can create a personalized schedule and better network with your colleagues. Unless marked Limited Capacity, sessions are open and you are free to come and go. 
Saturday, October 10 • 14:30 - 15:30
Data Track: Python for Scraping 3

Sign up or log in to save this to your schedule and see who's attending!

An introduction to webscraping with Python: This two-part, hands-on workshop will teach basic newsroom programming concepts using the Python language. We'll cover how to deconstruct a common reporting task -- gathering a table of data from a public website -- and assemble a solution from useful Python libraries that you can use again and again.

Prerequisites: Attendees should be familiar with HTML and the command line and be comfortable with databases and SQL. If you've ever written a string function in Excel ("=left(A2,5)"), you'll be fine.

Python has to be installed at your laptop before the training. Somebody can help you at the “Data Pub” on Thursday.


avatar for Adriana Homolova

Adriana Homolova

data journalist, KRO-NCRV
Data journalist @ Pointer / KRO-NCRVSleeps long hours and writes scrapers for fun.Listens to Boney M and so should you! https://www.youtube.com/watch?v=oR6eKmqSEa0
avatar for Tom Meagher

Tom Meagher

Deputy Managing Editor, The Marshall Project
Tom Meagher is the deputy managing editor of the Marshall Project, a nonprofit, non-partisan news organization covering crime and justice in America. A veteran reporter and editor, he previously led an interactive team for the Digital First Media newspaper chain and was the data editor... Read More →

Saturday October 10, 2015 14:30 - 15:30
Messanin (Data Hands-On)

Attendees (0)