Introduction to Programming for Public Policy (30550)

Final Projects (25%)

Working in pairs, students will ask a simple policy question. To answer it they will identify at least two disjoint data sources, merge them, perform a simple but correct statistical analysis and create a simple (but possibly dynamic) dashboard to illustrate this.

Due Dates

By October 31, students should form groups and propose the analysis and project. The proposal should identify a question and data sources that help them to address it (with links). A number of data sources are listed below, for inspiration. It should specify a ‘baseline’ functionality, describe several extensions, and include a link to the code repository.

By midnight December 1st, all code must be definitively checked in. A README should detail the sources, describe problems encountered and solutions, and provide clear instructions for launching the analysis and accessing the output.

During the finals period (December 4, 1:30-3:30), we will have 2-hour palooza, in which groups will describe their work and demonstrate any dynamic functionality, for others in the class and for the professor and TAs.

Baseline

You are aiming for something like our week 9 example on weather and crime. You can find a straightforward writeup of this work, here:

https://harris-ippp.github.io/weather

Suggested extensions

Grading

The grading rubric will be modified to benefit ambitious projects:

Datasets for Inspiration

Please go hunting for dataset that interest you. In the past, students have come up with really great datasets from their home countries. My suggestions below are super US-centric ones that I’ve used recently. But I particularly enjoy seeing foreign datasets.