Skip to content

Commit 9a77bde

Browse files
committed
First README
1 parent d22ecd3 commit 9a77bde

File tree

1 file changed

+21
-0
lines changed

1 file changed

+21
-0
lines changed

README.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,21 @@
1+
## U.S. Census Bureau commuting pattern parser
2+
3+
This takes commuting data from the U.S. Census Bureau, summarizes it to county or county-like geographies, then tries to build a couple useful reports out of it. Counties are handled by FIPS codes, a five-character identification system.
4+
5+
### county-highlights.json
6+
For a given county, what counties are important to it?
7+
8+
This is generated using `get-commuting-data.ipynb`. Supporting counties are shown in rough order of importantance, if they either represent at least 10 percent of the workforce in that county, or that county supplies at least 20 percent of its workforce (e.g., bedroom communities).
9+
10+
You can directly access that file at https://raw.githubusercontent.com/stucka/commuting-patterns/master/county-highlights.json
11+
12+
### us-county-commuters.json
13+
This contains data for every county that supports another, even if it's a single employee. Don't ignore the 'meta' tree.
14+
15+
### get-commuting-data.ipynb
16+
A Jupyter Notebook that downloads and processes the data. You may not need to run this, unless you want to change the cutoff points for the file you're probably looking for, `county-highlights.json`
17+
18+
This will download about 1gb of GZIPped files from the U.S. Census Bureau; combine them in an awful way to create about 8gb of data; and then begin processing it to create the above files. This takes a while.
19+
20+
### parsed folder
21+
Within the `parsed` folder are state-level reports, including the total number of commuters and total number of workers from each county. You're even less likely to need this.

0 commit comments

Comments
 (0)