Skip to content

Latest commit

 

History

History
23 lines (17 loc) · 1.75 KB

README.md

File metadata and controls

23 lines (17 loc) · 1.75 KB

This repository contains a set of scripts that allow the retrieval of the United Nations' Treaty Collection, and transformation of the retrieved data. Please open an issue if you find any errors or would like to suggest a feature. Pull requests are welcome!

Getting the data

You can clone (see below) or download the data contained in this repository or scrape it using un_data.py, which requires pandas, requests, and beautifulsoup4. These dependencies are listed in requirements.txt. The script will generate index.csv along with a folder containing the treaty data itself (treaties) and a folder containing declarations made by participants (contained in declarations).

git clone [email protected]:zmjones/untreaties.git
cd untreaties
pip install -r requirements.txt
python un_data.py

Transforming the data

The R script utilities.R contains a number of functions that make working with the raw data easier. You can load these functions by simply sourcing the file. It requires stringr, lubridate, and plyr. The loadData function allows the user to load a specific treaty given its chapter and treaty numbers, which are passed as strings. You can optionally expand the column names (if needed). If you choose to expand the column names you can also transform the data into a country-year format, given a start year and an end year (both passed as strings).

source("utilities.R")
loadData(chap = "10", treaty = "2")
loadData(chap = "10", treaty = "2", expand = TRUE)
loadData(chap = "10", treaty = "2", expand = TRUE, panel = TRUE, syear = "1945", eyear = "2013")