- Downloads wide Confirmed, Deaths and Recovered CSVs from https://github.com/CSSEGISandData/COVID-19.
- Merges and makes datasets narrow. Outputs as single CSV.
- Introduces new daily values in addition to running totals.
- Renames some countries.
- Joins to country data from Bing (generated by Microsoft Excel intelligence).
- Drops countries where Physician per Capita statistics could not be found.
- Adds Columns:
Growth Rate
Infected Percentage
Infected per Million
Net Cases
Healthcare System Saturation
This is used as ETL for a Power BI COVID-19 visualization.
Clone or download this repo. Python 3.7+ is required to use.
Install requirements:
pip install -r requirements.txt
Run main script. Specify output path.
python main.py -output "global.csv"