Program Content - Class 3

Agendas will be posted on this page at least one week before the start of each module, and any presentation material will be added after each module concludes. Module dates are as below:

  1. Module 1: Oct 11-13 (All participants go to UMD)

  2. Module 2: Nov 1-3 (participants attend at their originally indicated location; UMD, NYU, UW Seattle, or UChicago)

  3. Module 3: Nov 15-17 (participants attend at their originally indicated location; UMD, NYU, UW Seattle, or UChicago)

  4. Module 4: Dec 6-8 (participants attend at their originally indicated location; UMD, NYU, UW Seattle, or UChicago)

  5. Presentations: Feb 7th & 8th (optionally attend their originally indicated location or via WebEx)

Content on this page


Data documentation

data from agencies

additional documentation

  • IDHS database diagram: html, pdf

  • From Chapin Hall:

    • 2000 - Outcomes for the Income Maintenance of Caseload DURING Receipt (PDF)

    • 2000 - Outcomes for the Income Maintenance of Caseload AFTER Receipt (PDF)

    • 2004 - Understanding the Food Stamp Program Participation Decisions for TANF Leavers (PDF)

    • Policy Lab - Increasing Self-Sufficiency Among IL ABAWDs (PDF)

    • Deduplication and data cleaning report for IDHS data (docx)

  • More information on programs to help IL welfare recipients find stable jobs (doc)

REFERENCE Diagrams

  • IDHS - IDES reference diagram: html, pdf

  • IDOC - IDES reference diagram: html, pdf

  • Note: Join IDOC to IDHS using the column named "ssn_hash" in:

    • IDOC: "ildoc.person", "ildoc.ildoc_admit", and "ildoc.ildoc_exit"

    • IDHS: "idhs.hh_member" and "idhs.member"

  • Offenses and SPSS codes: lookup table


MODULE 1 - Agenda


 

 


 

 

 


 

Please note that the below is subject to change. All times Eastern US
Links for some of the training materials will be posted closer to the date of the sessions

Notebook zipfile

October 11, 2017: Welcome & Introduction, Core Datasets & Git

  • 08:00 AM - 08:45 AM Shuttle service to Van Munching Hall (~5 minutes’ travel time)

  • 08:45 AM - 09:00 AM Get settled

  • 09:00 AM - 10:15 AM Welcome and program introduction (Slides)

  • 10:15 AM - 11:00 AM ADRF training & data use agreements (Slides)

  • 11:00 AM - 11:15 AM Break

  • 11:15 AM - 11:45 AM Orientation to the ADRF

  • 11:45 AM - 12:30 PM Command line & git: introduction & basics

  • 12:30 PM - 01:30 PM Lunch

  • 01:30 PM - 02:45 PM Git, continued

  • 02:45 PM - 03:30 PM Defining & scoping data analysis projects

  • 03:30 PM - 03:45 PM Break

  • 03:45 PM - 05:00 PM Introduction to datasets

  • 05:00 PM - 05:15 PM Feedback

October 12, 2017: SQL & Databases

  • 08:00 AM - 08:45 AM Shuttle service to Van Munching Hall (~5 minutes’ travel time)

  • 08:45 AM - 09:00 AM Get settled

  • 09:00 AM - 09:15 AM Overview of day

  • 09:15 AM - 09:45 AM Revisit git and command line

  • 09:45 AM - 10:45 AM Introduction to SQL & databases

  • 10:45 AM - 11:00 AM Break

  • 11:00 AM - 12:30 PM Databases & SQL continued

  • 12:30 PM - 01:30 PM Lunch

  • 01:30 PM - 02:15 PM Guest lecture: John Thompson (slides)

  • 02:15 PM - 03:00 PM PostGIS & spatial SQL

  • 03:00 PM - 03:15 PM Break

  • 03:15 PM - 04:30 PM Explore data

  • 04:30 PM - 04:55 PM Daily recap

  • 04:55 PM - 05:00 PM Feedback

  • Evening: team project assignment

October 13, 2017: Python for Data Analysis

  • 08:00 AM - 08:45 AM Shuttle service to Van Munching Hall (~5 minutes’ travel time)

  • 08:45 AM - 09:00 AM Get settled

  • 09:00 AM - 09:15 AM Overview of day

  • 09:15 AM - 10:45 AM Intro to Python for data analysis

  • 10:45 AM - 11:00 AM Break

  • 11:00 AM - 11:30 AM Metrics: dealing with earnings data

  • 11:30 AM - 12:30 PM Python data analysis continued

  • 12:30 PM - 01:30 PM Lunch

  • 01:30 PM - 02:15 PM Problem solving data analysis

  • 02:15 PM - 02:30 PM Project scoping & goal setting

  • 02:30 PM - 03:45 PM Teamwork: project discussions

  • 03:45 PM - 04:00 PM Break

  • 04:00 PM - 04:45 PM Closing discussion & feedback


MODULE 2 - Agenda


 

 


 

 

 

 

Please note that the below is subject to change.
Links for some of the training materials will be posted closer to the date of the sessions

Note: Since there are three timezones participating, different locations will have different start & end times to accommodate project work. We give below times noted as per locations:

Eastern Zones (UMD, NYU, and CT) eastern time noted
09:00 AM - 11:00 AM (Project Work)
11:00 AM - 05:00 PM (Sessions)
Central Zones (Chicago) central time noted
09:00 AM - 10:00 AM (Project Work)
10:00 AM - 04:00 PM (Sessions)
04:00 PM - 05:00 PM (Project Work)
Pacific Zones (UW, Seattle) pacific time noted
08:00 AM - 02:00 PM (Sessions)
02:00 PM - 04:00 PM (Project Work)

Notebook zipfile

Detailed Agenda- all times Eastern

Nov 1, 2017: Record Linkage (For local start/end times, check the beginning of this section)

WebEx recording (full day) and Dealing with earnings data recording (Julia's presentation)

  • 09:00 AM - 09:30 AM (Eastern only) Dealing with earnings data (slides)

  • 09:30 AM - 11.00 AM Project Time for Eastern/Central Cohort

  • 11:00 AM - 11:15 AM Program recap/review (slides)

  • 11:15 AM - 12:15 PM SQL & databases

  • 12:15 PM - 12:30 PM Break

  • 12:30 PM - 01:30 PM Introduction to Record Linkage (link to Slides)

  • 01:30 PM - 02:30 PM Lunch

  • 02:30 PM - 03:15 PM Guest lecture: Greg Dobler (PDF)

  • 03:15 PM - 04:00 PM Record Linkage examples & exercises

  • 04:00 PM - 04:15 PM Break

  • 04:15 PM - 05:00 PM Record Linkage exercises (continued)

  • 05:00 PM - 05:30 PM (Central/Pacific only) Dealing with earnings data (slides)

  • 05:30 PM - 07:00 PM Project work time for Central/Pacific locations

Nov 2, 2017: Visualization & APIs (For local start/end times, check the beginning of this section)

WebEx recording

  • 09:00 AM - 11:00 AM Project Time for Eastern/Central Cohort

  • 11:00 AM - 12.15 PM Introduction to Data Visualization (Link to Slides)

  • 12:15 AM - 12:30 PM Break

  • 12:30 PM - 01:30 PM Continuing Data Visualization (Link to Slides)

  • 01:30 PM - 02.30 PM Lunch

  • 02:30 PM - 03.30 PM Introduction to APIs (Link to slides)

  • 03:30 PM - 03:45 PM Break

  • 03:45 PM - 05:00 PM (team choice) API exercises or project work

  • 05.00 PM - 07.00 PM Project work time for Central/Pacific locations

Nov 3, 2017: Network Analysis + Projects (For local start/end times, check the beginning of this section)

WebEx recording

  • 09.00 AM - 11.00 AM Project Time for Eastern/Central Cohort

  • 11:00 AM - 12:00 PM Network Analysis Lecture + Interactive Introduction

  • 12:00 PM - 12:15 PM Break

  • 12:15 PM - 01:30 PM Interactive exercises

  • 01:30 PM - 02:30 PM Lunch

  • 02:30 PM - 05:00 PM Project work

  • 05.00 PM - 07.00 PM Project work time for Central/Pacific locations

Please note that the below is subject to change.
Links for some of the training materials will be posted closer to the date of the sessions

Note: Since there are three timezones participating, different locations will have different start & end times to accommodate project work. We give below times noted as per locations:

Eastern Zones (UMD, NYU, and CT) eastern time noted
09:00 AM - 11:00 AM (Project Work)
11:00 AM - 05:00 PM (Sessions)
Central Zones (Chicago) central time noted
09:00 AM - 10:00 AM (Project Work)
10:00 AM - 04:00 PM (Sessions)
04:00 PM - 05:00 PM (Project Work)
Pacific Zones (UW, Seattle) pacific time noted
08:00 AM - 02:00 PM (Sessions)
02:00 PM - 04:00 PM (Project Work)

Detailed Agenda- all times Eastern

Notebooks: download

Nov 15, 2017: Intro to Machine Learning (For local start/end times, check the beginning of this section)

WebEx recording

  • 09:00 AM - 11.00 AM Project work time for Eastern and Central time zone locations

  • 11:00 AM - 11:15 AM Program recap/review

  • 11:15 AM - 12:30 PM Introduction to Machine Learning (Link)

  • 12:30 PM - 01:30 PM

    • Lunch (Eastern/Central)

    • Project Work (Pacific)

  • 01:30 PM - 03:00 PM Machine Learning (..cont) (Link)

  • 03:00 PM - 04:00 PM

    • (Eastern/Central) Project Work

    • (Pacific) Lunch

  • 04:00 PM - 05:00 PM Machine Learning Methods (Features/Labels)

  • 05:00 PM - 07:00 PM (Pacific) Project work

Nov 16, 2017: Machine Learning (For local start/end times, check the beginning of this section)

WebEx recording, 1st notebook session

  • 09:00 AM - 11:00 AM Project work time for Eastern and Central time zone locations

    • Suggested project outline (slide)

  • 11:00 AM - 12.30 PM Machine Learning Methods &/ Notebooks for Modeling

  • 12:30 PM - 01:30 PM

    • (Eastern/Central) Lunch

    • (Pacific) Project Work

WebEx recording, Guest speaker through end of day

  • 01:30 PM - 02.15 PM Guest Speaker: Nikhil Naik (slides)

  • 02:15 PM - 03.00 PM Machine Learning Recap

  • 03:00 PM - 04:00 PM

    • (Eastern/Central) Project Work

    • (Pacific) Lunch

  • 04:00 PM - 05:00 PM Project Work with Machine Learning

  • 05:00 PM - 07:00 PM (Pacific) Project work

  • (optional) happy hour

Nov 17, 2017: Text Analysis (For local start/end times, check the beginning of this section)

WebEx recording

  • 09:00 AM - 11:00 AM Project work time for Eastern and Central time zone locations

  • 11:00 AM - 12.30 PM Introduction to Text Analysis (slides)

  • 12:30 PM - 01:30 PM

    • (Eastern/Central) Lunch

    • (Pacific) Project Work

  • 01:30 PM - 03.00 PM Text Analytics (Jupyter Notebook)

  • 03:00 PM - 04:00 PM

    • (Eastern/Central) Project Work

    • (Pacific) Lunch

  • 04:00 PM - 05:00 PM

    • (Eastern & Central) Interim Project Presentations & feedback

    • (Pacific) Project work

  • 05:00 PM - 07:00 PM (Pacific) Project work & Interim Project Presentations & Feedback

MODULE 3- Agenda

Please note that the below is subject to change.
Links for some of the training materials will be posted closer to the date of the sessions

Note: Since there are three timezones participating, different locations will have different start & end times to accommodate project work. We give below times noted as per locations:

Eastern Zones (UMD, NYU, and CT) eastern time noted
09:00 AM - 11:00 AM (Project Work)
11:00 AM - 05:00 PM (Sessions)
Central Zones (Chicago) central time noted
09:00 AM - 10:00 AM (Project Work)
10:00 AM - 04:00 PM (Sessions)
04:00 PM - 05:00 PM (Project Work)
Pacific Zones (UW, Seattle) pacific time noted
08:00 AM - 02:00 PM (Sessions)
02:00 PM - 04:00 PM (Project Work)

Detailed Agenda- all times Eastern

Dec 6, 2017: Web Scraping (For local start/end times, check the beginning of this section)

WebEx recordings: morning, afternoon

  • 09:00 AM - 10.00 AM Project work time for Eastern and Central time zone locations

  • 10:00 AM - 11:00 AM Interim Presentations (Eastern & Central nodes)

  • 11:00 AM - 11:15 PM Program Review

  • 11:15 AM - 12:30 PM Web Scraping (Intro)

  • 12:30 PM - 01:30 PM

    • (Eastern/Central) Lunch

    • (Pacific) Project Work

  • 01:30 PM - 02:15 PM Guest Speaker: John C Havens (slides, pdf)

  • 02:15 PM - 03.00 PM Machine Learning Recap

  • 03:00 PM - 04:00 PM

    • (Eastern/Central) Project Work

    • (Pacific) Lunch

  • 04:00 PM - 05:00 PM Project Work

  • 05:00 PM - 07:00 PM (Pacific) Project work + Interim Presentations

Dec 7, 2017: Inference & Big Data (For local start/end times, check the beginning of this section)

WebEx recording: morning, afternoon

  • 09:00 AM - 11:00 AM Project work time for Eastern and Central time zone locations

  • 11:00 AM - 12.30 PM

  • 12:30 PM - 01:30 PM

    • (Eastern/Central) Lunch

    • (Pacific) Project Work

  • 01:30 PM - 03.00 PM Big Data (Tim Savage - slides)

  • 03:00 PM - 04:00 PM

    • (Eastern/Central) Project Work

    • (Pacific) Lunch

  • 04:00 PM - 05:00 PM Inference & Exercises

  • 05:00 PM - 07:00 PM (Pacific) Project work

  • (optional) happy hour

Dec 8, 2017: Privacy & Confidentiality (For local start/end times, check the beginning of this section)

WebEx recordings: morning, afternoon

  • 09:00 AM - 11:00 AM Project work time for Eastern and Central time zone locations

  • 11:00 AM - 12.15 PM Privacy & Confidentiality (slides)

  • 12.15 PM - 12.45 PM Machine Learning Q&A (over Pizza Lunch)

  • 12:45 PM - 01:30 PM Break

  • 01:30 PM - 03.00 PM Privacy & Confidentiality Exercises

  • 03:00 PM - 04:00 PM

    • (Eastern/Central) Project Work

    • (Pacific) Lunch

  • 04:00 PM - 05:00 PM Walk Through of Disclosure Review

  • 05.00 PM - 05.15 PM (Eastern/Central) Program Closing & Suggested Work Timeline

  • 05:00 PM - 07:00 PM (Pacific) Project work + Program Closing

 

MODULE 4- Agenda

 

 

 

 

 


 

Sixteen teams will present projects on February 7th & 8th between 12:30 - 4:30pm Eastern. Each team will be given 20 minutes to present followed by up to 10 minutes of Q&A lead by the program directors, Rayid Ghani, Frauke Kreuter, and Julia Lane. Please see the WebEx information, below, for how to view presentations remotely.

Presentation schedule (all times are Eastern US)

February 8 (recording)

  • 12:30 - UMD 2: Predicting Future Employment Gap (ppt)

  • 1:00 - CT 1: Welfare dependency & transition from TANF (ppt)

  • 1:30 - UW 2: Mental Illness and Drug Dependency (MIDD) and access to public transit (pdf)

  • 2:00 - NYU 2: Vulernable Populations' Access to Points of Distribution for Public Health Emergencies (pdf)

  • 2:30 Break

  • 2:40 - UMD 3: Characteristics of TANF recipients in Illinois

  • 3:10 - NYU 1: How can we better predict which TANF recipients will be successful? (pdf)

  • 3:40 - UMD 5: First time prisoners: predicting recidivism

  • 4:10 - NYU 3: Are Persons Released from Illinois Department of Corrections Receiving Needed Social Service Benefits? (ppt)

February 7 (recording)

  • 12:30 - UC 3: Predicting Recidivism due to Technical Violation (ppt)

  • 1:00 - UMD 4: Modeling the School to Prison pipeline in Chicago, IL (ppt)

  • 1:30 - NYU 4: Success, the Flip-side to the Recidivism Conversation. (ppt)

  • 2:00 - UC 1: Benefits After Release: Does it make a difference?

  • 2:30 - UMD 6: The TANF Ban and Criminal Activity: Evidence from IL Admin Data (pdf)

  • 3:00 - CT 2: A look into Peoria and Cook County Illinois - Measuring access to wages by location (pdf)

  • 3:30 - UMD 1: Predicting Future Earnings of Illinois Human Service Benefit Recipients (ppt)

  • 4:00 - UW 1: Reducing Return to Welfare (ppt)

 


Peer review questions.

If possible please respond via this Form; however if you cannot access the form please send answers to the below to dataanalytics@umd.edu:

  • Team number (eg NYU 4, UMD 2, UC 3)

  • Would the approach used in this project be useful for your agency/organization? (Yes, No, Maybe, N/A)

  • What did you like about the project? (short answer)

  • What aspects of the project do you think could be improved? (short answer)

WebEx information

Attendee links

WebEx testing