google_covid19_mobility_reports_scraper

Scrapes the data from ALL the PDF files from Google's Mobility Report: https://www.google.com/covid19/mobility/

Tries to make the data machine readable.

More to come.

just give me the data

Two formats, will be expanded with multiple dates:

cd pdf
for file in *.pdf; do pdftotext -layout "$file" "$file.txt"; done

Make sure to use the -layout option. More details here: https://www.xpdfreader.com/pdftotext-man.html

OSX

brew install poppler

Ubuntu/Debian

sudo apt-get install libpoppler-cpp-dev

Conda/Windows

conda install -c conda-forge poppler

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
pdf		pdf
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
country_dict.json		country_dict.json
fetch_pdfs.sh		fetch_pdfs.sh
scrape.py		scrape.py