Program Content - Class 3
Agendas will be posted on this page at least one week before the start of each module, and any presentation material will be added after each module concludes. Module dates are as below:
Module 1: Oct 11-13 (All participants go to UMD)
Module 2: Nov 1-3 (participants attend at their originally indicated location; UMD, NYU, UW Seattle, or UChicago)
Module 3: Nov 15-17 (participants attend at their originally indicated location; UMD, NYU, UW Seattle, or UChicago)
Module 4: Dec 6-8 (participants attend at their originally indicated location; UMD, NYU, UW Seattle, or UChicago)
Presentations: Feb 7th & 8th (optionally attend their originally indicated location or via WebEx)
Content on this page
Data documentation
data from agencies
Illinois Department of Human Services (IDHS): documentation link
Illinois Department of Employment Services (IDES)
Individual wage records: documentation link
Employer data: documentation link
Illinois Department of Corrections (IDOC) admissions: link to pdf
IDOC exits: link to pdf
More details on EDUCLVL and HCLASS fields:
BIRTHPL lookup file
HUD program data: link to pdf (note descriptions are of unaggregated data and not all fields will be included due to data use agreements)
Parole data dictionary: link to Sheet
additional documentation
From Chapin Hall:
2000 - Outcomes for the Income Maintenance of Caseload DURING Receipt (PDF)
2000 - Outcomes for the Income Maintenance of Caseload AFTER Receipt (PDF)
2004 - Understanding the Food Stamp Program Participation Decisions for TANF Leavers (PDF)
Policy Lab - Increasing Self-Sufficiency Among IL ABAWDs (PDF)
Deduplication and data cleaning report for IDHS data (docx)
More information on programs to help IL welfare recipients find stable jobs (doc)
REFERENCE Diagrams
Note: Join IDOC to IDHS using the column named "ssn_hash" in:
IDOC: "ildoc.person", "ildoc.ildoc_admit", and "ildoc.ildoc_exit"
IDHS: "idhs.hh_member" and "idhs.member"
Offenses and SPSS codes: lookup table
MODULE 1 - Agenda
Please note that the below is subject to change. All times Eastern US
Links for some of the training materials will be posted closer to the date of the sessions
October 11, 2017: Welcome & Introduction, Core Datasets & Git
08:00 AM - 08:45 AM Shuttle service to Van Munching Hall (~5 minutes’ travel time)
08:45 AM - 09:00 AM Get settled
09:00 AM - 10:15 AM Welcome and program introduction (Slides)
10:15 AM - 11:00 AM ADRF training & data use agreements (Slides)
11:00 AM - 11:15 AM Break
11:15 AM - 11:45 AM Orientation to the ADRF
11:45 AM - 12:30 PM Command line & git: introduction & basics
12:30 PM - 01:30 PM Lunch
01:30 PM - 02:45 PM Git, continued
02:45 PM - 03:30 PM Defining & scoping data analysis projects
03:30 PM - 03:45 PM Break
03:45 PM - 05:00 PM Introduction to datasets
05:00 PM - 05:15 PM Feedback
October 12, 2017: SQL & Databases
08:00 AM - 08:45 AM Shuttle service to Van Munching Hall (~5 minutes’ travel time)
08:45 AM - 09:00 AM Get settled
09:00 AM - 09:15 AM Overview of day
09:15 AM - 09:45 AM Revisit git and command line
09:45 AM - 10:45 AM Introduction to SQL & databases
10:45 AM - 11:00 AM Break
11:00 AM - 12:30 PM Databases & SQL continued
12:30 PM - 01:30 PM Lunch
01:30 PM - 02:15 PM Guest lecture: John Thompson (slides)
02:15 PM - 03:00 PM PostGIS & spatial SQL
03:00 PM - 03:15 PM Break
03:15 PM - 04:30 PM Explore data
04:30 PM - 04:55 PM Daily recap
04:55 PM - 05:00 PM Feedback
Evening: team project assignment
October 13, 2017: Python for Data Analysis
08:00 AM - 08:45 AM Shuttle service to Van Munching Hall (~5 minutes’ travel time)
08:45 AM - 09:00 AM Get settled
09:00 AM - 09:15 AM Overview of day
09:15 AM - 10:45 AM Intro to Python for data analysis
10:45 AM - 11:00 AM Break
11:00 AM - 11:30 AM Metrics: dealing with earnings data
11:30 AM - 12:30 PM Python data analysis continued
12:30 PM - 01:30 PM Lunch
01:30 PM - 02:15 PM Problem solving data analysis
02:15 PM - 02:30 PM Project scoping & goal setting
02:30 PM - 03:45 PM Teamwork: project discussions
03:45 PM - 04:00 PM Break
04:00 PM - 04:45 PM Closing discussion & feedback
MODULE 2 - Agenda
Please note that the below is subject to change.
Links for some of the training materials will be posted closer to the date of the sessions
Note: Since there are three timezones participating, different locations will have different start & end times to accommodate project work. We give below times noted as per locations:
Eastern Zones (UMD, NYU, and CT) eastern time noted
09:00 AM - 11:00 AM (Project Work)
11:00 AM - 05:00 PM (Sessions)
Central Zones (Chicago) central time noted
09:00 AM - 10:00 AM (Project Work)
10:00 AM - 04:00 PM (Sessions)
04:00 PM - 05:00 PM (Project Work)
Pacific Zones (UW, Seattle) pacific time noted
08:00 AM - 02:00 PM (Sessions)
02:00 PM - 04:00 PM (Project Work)
Detailed Agenda- all times Eastern
Nov 1, 2017: Record Linkage (For local start/end times, check the beginning of this section)
WebEx recording (full day) and Dealing with earnings data recording (Julia's presentation)
09:00 AM - 09:30 AM (Eastern only) Dealing with earnings data (slides)
09:30 AM - 11.00 AM Project Time for Eastern/Central Cohort
11:00 AM - 11:15 AM Program recap/review (slides)
11:15 AM - 12:15 PM SQL & databases
12:15 PM - 12:30 PM Break
12:30 PM - 01:30 PM Introduction to Record Linkage (link to Slides)
01:30 PM - 02:30 PM Lunch
02:30 PM - 03:15 PM Guest lecture: Greg Dobler (PDF)
03:15 PM - 04:00 PM Record Linkage examples & exercises
04:00 PM - 04:15 PM Break
04:15 PM - 05:00 PM Record Linkage exercises (continued)
05:00 PM - 05:30 PM (Central/Pacific only) Dealing with earnings data (slides)
05:30 PM - 07:00 PM Project work time for Central/Pacific locations
Nov 2, 2017: Visualization & APIs (For local start/end times, check the beginning of this section)
09:00 AM - 11:00 AM Project Time for Eastern/Central Cohort
11:00 AM - 12.15 PM Introduction to Data Visualization (Link to Slides)
12:15 AM - 12:30 PM Break
12:30 PM - 01:30 PM Continuing Data Visualization (Link to Slides)
01:30 PM - 02.30 PM Lunch
02:30 PM - 03.30 PM Introduction to APIs (Link to slides)
03:30 PM - 03:45 PM Break
03:45 PM - 05:00 PM (team choice) API exercises or project work
05.00 PM - 07.00 PM Project work time for Central/Pacific locations
Nov 3, 2017: Network Analysis + Projects (For local start/end times, check the beginning of this section)
09.00 AM - 11.00 AM Project Time for Eastern/Central Cohort
11:00 AM - 12:00 PM Network Analysis Lecture + Interactive Introduction
12:00 PM - 12:15 PM Break
12:15 PM - 01:30 PM Interactive exercises
01:30 PM - 02:30 PM Lunch
02:30 PM - 05:00 PM Project work
05.00 PM - 07.00 PM Project work time for Central/Pacific locations
Please note that the below is subject to change.
Links for some of the training materials will be posted closer to the date of the sessions
Note: Since there are three timezones participating, different locations will have different start & end times to accommodate project work. We give below times noted as per locations:
Eastern Zones (UMD, NYU, and CT) eastern time noted
09:00 AM - 11:00 AM (Project Work)
11:00 AM - 05:00 PM (Sessions)
Central Zones (Chicago) central time noted
09:00 AM - 10:00 AM (Project Work)
10:00 AM - 04:00 PM (Sessions)
04:00 PM - 05:00 PM (Project Work)
Pacific Zones (UW, Seattle) pacific time noted
08:00 AM - 02:00 PM (Sessions)
02:00 PM - 04:00 PM (Project Work)
Detailed Agenda- all times Eastern
Notebooks: download
Nov 15, 2017: Intro to Machine Learning (For local start/end times, check the beginning of this section)
09:00 AM - 11.00 AM Project work time for Eastern and Central time zone locations
11:00 AM - 11:15 AM Program recap/review
11:15 AM - 12:30 PM Introduction to Machine Learning (Link)
12:30 PM - 01:30 PM
Lunch (Eastern/Central)
Project Work (Pacific)
01:30 PM - 03:00 PM Machine Learning (..cont) (Link)
03:00 PM - 04:00 PM
(Eastern/Central) Project Work
(Pacific) Lunch
04:00 PM - 05:00 PM Machine Learning Methods (Features/Labels)
05:00 PM - 07:00 PM (Pacific) Project work
Nov 16, 2017: Machine Learning (For local start/end times, check the beginning of this section)
WebEx recording, 1st notebook session
09:00 AM - 11:00 AM Project work time for Eastern and Central time zone locations
Suggested project outline (slide)
11:00 AM - 12.30 PM Machine Learning Methods &/ Notebooks for Modeling
12:30 PM - 01:30 PM
(Eastern/Central) Lunch
(Pacific) Project Work
WebEx recording, Guest speaker through end of day
01:30 PM - 02.15 PM Guest Speaker: Nikhil Naik (slides)
02:15 PM - 03.00 PM Machine Learning Recap
03:00 PM - 04:00 PM
(Eastern/Central) Project Work
(Pacific) Lunch
04:00 PM - 05:00 PM Project Work with Machine Learning
05:00 PM - 07:00 PM (Pacific) Project work
(optional) happy hour
Nov 17, 2017: Text Analysis (For local start/end times, check the beginning of this section)
09:00 AM - 11:00 AM Project work time for Eastern and Central time zone locations
11:00 AM - 12.30 PM Introduction to Text Analysis (slides)
12:30 PM - 01:30 PM
(Eastern/Central) Lunch
(Pacific) Project Work
01:30 PM - 03.00 PM Text Analytics (Jupyter Notebook)
03:00 PM - 04:00 PM
(Eastern/Central) Project Work
(Pacific) Lunch
04:00 PM - 05:00 PM
(Eastern & Central) Interim Project Presentations & feedback
(Pacific) Project work
05:00 PM - 07:00 PM (Pacific) Project work & Interim Project Presentations & Feedback
MODULE 3- Agenda
Please note that the below is subject to change.
Links for some of the training materials will be posted closer to the date of the sessions
Note: Since there are three timezones participating, different locations will have different start & end times to accommodate project work. We give below times noted as per locations:
Eastern Zones (UMD, NYU, and CT) eastern time noted
09:00 AM - 11:00 AM (Project Work)
11:00 AM - 05:00 PM (Sessions)
Central Zones (Chicago) central time noted
09:00 AM - 10:00 AM (Project Work)
10:00 AM - 04:00 PM (Sessions)
04:00 PM - 05:00 PM (Project Work)
Pacific Zones (UW, Seattle) pacific time noted
08:00 AM - 02:00 PM (Sessions)
02:00 PM - 04:00 PM (Project Work)
Detailed Agenda- all times Eastern
Dec 6, 2017: Web Scraping (For local start/end times, check the beginning of this section)
WebEx recordings: morning, afternoon
09:00 AM - 10.00 AM Project work time for Eastern and Central time zone locations
10:00 AM - 11:00 AM Interim Presentations (Eastern & Central nodes)
11:00 AM - 11:15 PM Program Review
11:15 AM - 12:30 PM Web Scraping (Intro)
12:30 PM - 01:30 PM
(Eastern/Central) Lunch
(Pacific) Project Work
01:30 PM - 02:15 PM Guest Speaker: John C Havens (slides, pdf)
02:15 PM - 03.00 PM Machine Learning Recap
03:00 PM - 04:00 PM
(Eastern/Central) Project Work
(Pacific) Lunch
04:00 PM - 05:00 PM Project Work
05:00 PM - 07:00 PM (Pacific) Project work + Interim Presentations
Dec 7, 2017: Inference & Big Data (For local start/end times, check the beginning of this section)
WebEx recording: morning, afternoon
09:00 AM - 11:00 AM Project work time for Eastern and Central time zone locations
11:00 AM - 12.30 PM
12:30 PM - 01:30 PM
(Eastern/Central) Lunch
(Pacific) Project Work
01:30 PM - 03.00 PM Big Data (Tim Savage - slides)
03:00 PM - 04:00 PM
(Eastern/Central) Project Work
(Pacific) Lunch
04:00 PM - 05:00 PM Inference & Exercises
05:00 PM - 07:00 PM (Pacific) Project work
(optional) happy hour
Dec 8, 2017: Privacy & Confidentiality (For local start/end times, check the beginning of this section)
WebEx recordings: morning, afternoon
09:00 AM - 11:00 AM Project work time for Eastern and Central time zone locations
11:00 AM - 12.15 PM Privacy & Confidentiality (slides)
12.15 PM - 12.45 PM Machine Learning Q&A (over Pizza Lunch)
12:45 PM - 01:30 PM Break
01:30 PM - 03.00 PM Privacy & Confidentiality Exercises
03:00 PM - 04:00 PM
(Eastern/Central) Project Work
(Pacific) Lunch
04:00 PM - 05:00 PM Walk Through of Disclosure Review
05.00 PM - 05.15 PM (Eastern/Central) Program Closing & Suggested Work Timeline
05:00 PM - 07:00 PM (Pacific) Project work + Program Closing
MODULE 4- Agenda
Sixteen teams will present projects on February 7th & 8th between 12:30 - 4:30pm Eastern. Each team will be given 20 minutes to present followed by up to 10 minutes of Q&A lead by the program directors, Rayid Ghani, Frauke Kreuter, and Julia Lane. Please see the WebEx information, below, for how to view presentations remotely.
Presentation schedule (all times are Eastern US)
February 8 (recording)
12:30 - UMD 2: Predicting Future Employment Gap (ppt)
1:00 - CT 1: Welfare dependency & transition from TANF (ppt)
1:30 - UW 2: Mental Illness and Drug Dependency (MIDD) and access to public transit (pdf)
2:00 - NYU 2: Vulernable Populations' Access to Points of Distribution for Public Health Emergencies (pdf)
2:30 Break
2:40 - UMD 3: Characteristics of TANF recipients in Illinois
3:10 - NYU 1: How can we better predict which TANF recipients will be successful? (pdf)
3:40 - UMD 5: First time prisoners: predicting recidivism
4:10 - NYU 3: Are Persons Released from Illinois Department of Corrections Receiving Needed Social Service Benefits? (ppt)
February 7 (recording)
12:30 - UC 3: Predicting Recidivism due to Technical Violation (ppt)
1:00 - UMD 4: Modeling the School to Prison pipeline in Chicago, IL (ppt)
1:30 - NYU 4: Success, the Flip-side to the Recidivism Conversation. (ppt)
2:00 - UC 1: Benefits After Release: Does it make a difference?
2:30 - UMD 6: The TANF Ban and Criminal Activity: Evidence from IL Admin Data (pdf)
3:00 - CT 2: A look into Peoria and Cook County Illinois - Measuring access to wages by location (pdf)
3:30 - UMD 1: Predicting Future Earnings of Illinois Human Service Benefit Recipients (ppt)
4:00 - UW 1: Reducing Return to Welfare (ppt)
Peer review questions.
If possible please respond via this Form; however if you cannot access the form please send answers to the below to dataanalytics@umd.edu:
Team number (eg NYU 4, UMD 2, UC 3)
Would the approach used in this project be useful for your agency/organization? (Yes, No, Maybe, N/A)
What did you like about the project? (short answer)
What aspects of the project do you think could be improved? (short answer)
WebEx information
Attendee links
February 7 attendee link
February 8 attendee link
WebEx testing
Test instructions from WebEx to ensure software works
Connecting to WebEx as an Attendee: PDF
Connecting to WebEx as a Panelist: PDF