Issue #59 - Inflation Adjustment #157

JennEYoon · 2019-11-01T01:40:18Z

Ready for your review of PR for Issue #59.

Verified that all ltdb variables that are affected by inflation adjustment are included in inflate_cols and the named strings match data from ltdb column of variables.csv. Function file data/data.py, line 494. I have checked every row of variables.csv file. There are no new variables for inflation adjustment. There are 8 variables that are affected by inflation adjustment.
Copied similar logic from ltdb inflate_cols into "store_ncdb" function definition. Use data from ncdb column in variables.csv. In data/data.py, see lines 678 - 701. Only 3 rows related to inflation adjustment have named strings in ncdb column. I am not sure if this logic works as desired. I added a definition for year, which is used in the last 2 lines.
Corrected several typos in documentation string part of data.py. ("dataframe" and "instantiation")

Thank you! Jennifer Yoon

knaaptime

thanks for getting this started @JennEYoon

a few things before we can get this merged

knaaptime · 2019-11-04T22:39:55Z

geosnap/data/data.py

@@ -675,6 +675,31 @@ def store_ncdb(filepath):

    df = df.set_index("geoid")

+    #### Beginning of New Code ####


can you remove these comments?

knaaptime

can you delete the temp directory?

knaaptime · 2019-11-04T23:37:31Z

geosnap/data/data.py

+    year = df["year"]
+
+    # Inflation logic for ncdb.
+    inflate_cols = [
+        "MDVALHS",
+        "MDGRENT",
+        "MDHHY",
+    ]
+    # Five rows have missing ncdb labels in variables.csv.
+    # per capita income, missing
+    # median household income white, missing
+    # median household income black, missing
+    # median household income hispanic, missing
+    # median household income asian, missing
+
+    inflate_available = list(set(df.columns).intersection(set(inflate_cols)))
+
+    if len(inflate_available):
+        df = adjust_inflation(df, inflate_available, year)
+    return df
+
+    #### End of New Code ####


this will need to work a little bit differently from the ltdb example. Unlike in the store_ltdb function, we've only got a single dataframe here, so we need to slice the ncdb dataframe by year and apply the adjust_inflation to each slice

Ok, I will take a look.

sjsrey · 2020-04-14T19:34:24Z

Closing as this is now covered in #216

JennEYoon added 3 commits October 31, 2019 21:04

Update data.py inflate_cols fuction

a2aa817

Fix one typo.

696f518

Remove 1 trailing white space

4adbb29

knaaptime requested changes Nov 4, 2019

View reviewed changes

knaaptime reviewed Nov 4, 2019

View reviewed changes

knaaptime requested changes Nov 4, 2019

View reviewed changes

sjsrey closed this Apr 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue #59 - Inflation Adjustment #157

Issue #59 - Inflation Adjustment #157

JennEYoon commented Nov 1, 2019

knaaptime left a comment

knaaptime Nov 4, 2019

JennEYoon Nov 5, 2019

knaaptime left a comment

knaaptime Nov 4, 2019

JennEYoon Nov 5, 2019

sjsrey commented Apr 14, 2020

		@@ -675,6 +675,31 @@ def store_ncdb(filepath):

		df = df.set_index("geoid")

		#### Beginning of New Code ####

Issue #59 - Inflation Adjustment #157

Issue #59 - Inflation Adjustment #157

Conversation

JennEYoon commented Nov 1, 2019

knaaptime left a comment

Choose a reason for hiding this comment

knaaptime Nov 4, 2019

Choose a reason for hiding this comment

JennEYoon Nov 5, 2019

Choose a reason for hiding this comment

knaaptime left a comment

Choose a reason for hiding this comment

knaaptime Nov 4, 2019

Choose a reason for hiding this comment

JennEYoon Nov 5, 2019

Choose a reason for hiding this comment

sjsrey commented Apr 14, 2020