You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Right now there are 2 cases when making a histogram for a categorical variable:
The user enters a list of bins, and the laplace mechanism is used
The user does NOT enter a list of bins, and the stability mechanism is used
We want to implement a third case:
3) the user enters a list of bins, but the list is a subset of the full list of levels the variable takes. So we add an NA bins to the list of bins, set all levels that were not entered in the list of bins to NA, and then use the stability mechanism
In implementing this third case, we will use the existing histogramCategoricalBins function in utilities-histogram.R
The text was updated successfully, but these errors were encountered:
Ira and I discussed this at length, and we decided this issue should be tabled for now.
Given the way the library is structured now, where there are export() statements in the statistics to call the mechanisms, there is no logical way to set a local attribute in a subclass and the check for its existence.
We plan to do major restructuring of the library to have the mechanisms and statistics be completely separate entities, and in this case it will be more possible to set impute as an attribute of only the histogram statistic.
When the library is restructured, we can revisit the issue of conditioning the call to fillMissing() on impute for the histogram statistic.
MeganFantes
changed the title
Add NA bin to histogram when user enters a list of histogram bins that are a subset of all variable levels
Add NA bin to histogram when user enters a list of histogram bins that are a subset of all variable levels (tabled until library restructured)
Aug 21, 2019
Right now there are 2 cases when making a histogram for a categorical variable:
We want to implement a third case:
3) the user enters a list of bins, but the list is a subset of the full list of levels the variable takes. So we add an NA bins to the list of bins, set all levels that were not entered in the list of bins to NA, and then use the stability mechanism
In implementing this third case, we will use the existing
histogramCategoricalBins
function inutilities-histogram.R
The text was updated successfully, but these errors were encountered: