Skip to content

Final Project for CAPP 30254 Machine Learning for Public Policy

Notifications You must be signed in to change notification settings

chankrista/ml_final_project

 
 

Repository files navigation

The Role of Networks in “Bad Actor” Identification: Informing Investigative Journalism

CAPP 30254 Machine Learning for Public Policy

Final Project

Aequitas.ipynb: run aequitas module to test for fairness and bias of the models

crime_portal.py: adds features generated from the Chicago Open Data Portal

data: folder containing the data used for the project

descriptive_stats.ipynb: some descriptive statistics of the data

feature_generation.py: contains the code that generates the model's features

feature_list.xlsx: description of all the features

full_pipeline.py: defines the TrainTest and RawDfs classes

ml_loop.py: code used to run the models with different parameters and evaluation metrics

read_data.py: code to read the datasets used in the project

README.md: this file

report.pdf: report containing the description, implementation and findings of the analysis

requirements.txt: libraries and versions required for running the code

run_pipeline.ipynb: runs the machine learning models (takes about five hours to run)

train_test.py: code that performs the temporal splits on the datasets

About

Final Project for CAPP 30254 Machine Learning for Public Policy

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 97.0%
  • Python 3.0%