Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

reform hh parsing pipeline #11

Open
chunhochow opened this issue Jul 23, 2024 · 0 comments
Open

reform hh parsing pipeline #11

chunhochow opened this issue Jul 23, 2024 · 0 comments

Comments

@chunhochow
Copy link
Member

For 2022/, it seems like only the raw hh file is used in all of the summary notebooks in 04b-summary_notebooks (via load_hh_raw() in utils). If this is indeed the case (to be verified) and the processed hh files aren't used, should we produce so many iterations/versions of the hh file as it goes through the different steps of the processing pipeline? This seems to just be adding to the cognitive load and confusion of what exactly the files are after each specific step of the pipeline.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant