Skip to content

dorisyan1122/Crime-Predictive-Modeling

Repository files navigation

PPOL-6819 Final Project: Predictive Modeling of County-level Crime Rate

This is Ziwen Lu, Xuyan Xiu, and Doris Yan's machine learning project to predict 2016 crime rate using 2010 as the modeling year, and we compare the actual 2016 crime data with our predictions. Please click on GitHub Pages for details.

Our project contains four parts:

  1. full write-up, including introduction, model and methods, results and limitations of our project;
  2. geospatial mapping of average crime rate from 2002 to 2014;
  3. predictive modeling of 2010 crime rate on 2016 crime rate;
  4. slides for presentation.

Our models are robust to predict the unseen data. Applying our model of violent crime rate on the implementation data in 2016 generates RMSE of 0.593, which means the average difference between our model's predicted values and the actual values is 0.593. The unit for RMSE is log crime rate per 100,000 people. Similarly, for property crime rate, the average difference between our model's predicted values and the actual values is 0.459.