Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Normalize addresses #17

Open
waldoj opened this issue Jun 25, 2013 · 2 comments
Open

Normalize addresses #17

waldoj opened this issue Jun 25, 2013 · 2 comments

Comments

@waldoj
Copy link
Member

waldoj commented Jun 25, 2013

Address data does not appear to be normalized. For instance, Virginia Engineers PAC's 09/07/2012 contribution says that their primary place of business is "Richmon, VA." (To be fair, that's not address data.) It appears that the software used by many committees is providing normalization, but they normalize differently. For instance, some software normalizes on long street suffixes ("Court," "Boulevard," "Road," etc.), while some software normalizes on short street suffixes ("Ct.," "Blvd.," "Rd.," etc.) So the good news is that reports often have internal consistency that should make it easy to join all of the reports in collective consistency.

Implement the an address normalization system (presumably the USPS's API) to deal with this problem.

The only question is at what point this should be done. Is it appropriate to do this prior to saving the data and generating the JSON? Or is it wrong to alter the SBE's data? Wouldn't this mean making tens of thousands of API calls every time that the parser is run?

This might be an argument for standardizing addresses via a cruder, local function at the time of input, and save the USPS API calls to be used beyond the Saberva pipeline.

waldoj added a commit that referenced this issue Jun 29, 2013
Towards #17. This is not as good as interfacing with the USPS's API,
which can deal with common address errors (e.g., "Richmon" ->
"Richmond"), but instead merely standardizes "Virginia" as "VA,"
"Street" as "St.," etc.
@waldoj
Copy link
Member Author

waldoj commented Oct 3, 2013

I signed up for the USPS API months ago, getting as far as the part where you wait for approval. I never heard back. So that ain't gonna happen for people.

@waldoj
Copy link
Member Author

waldoj commented Nov 4, 2014

The Census Bureau's geocoding API might be the way to do this.

waldoj added a commit that referenced this issue Nov 4, 2014
It’s not going to happen. Per #17.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant