Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Function to download history data #56

Merged
merged 10 commits into from
Apr 24, 2022
Merged

Function to download history data #56

merged 10 commits into from
Apr 24, 2022

Conversation

llrs
Copy link
Contributor

@llrs llrs commented Feb 28, 2022

This is a draft after some iterations on the function (I originally used bash).
It depends on the number of columns on the file to simplify and merge the different files.

I hope it is clear enough (but maybe it could be simplified with a helper function).
It also iterates two times through the list of .csv files, that could be reduced but increasing the memory required earlier. (I don't think this is an issue and it could help if column names are reordered).
It doesn't check if data was recently downloaded or cache the data.

Let me know how do you see it and I'll finish writing the documentation.

@llrs llrs marked this pull request as ready for review March 2, 2022 13:38
@Bisaloo
Copy link
Member

Bisaloo commented Mar 10, 2022

Before I go into the actual review, I have a couple of questions for you:

  • Do you think we should reformat older files to make sure all files use the same format?
  • Should we remove the non-csv files from the history branch? I believe cran-diagram.png was added by mistake and Hadley's analysis.R could be promoted to the main branch alongside your function to make it more discoverable.

@llrs
Copy link
Contributor Author

llrs commented Mar 10, 2022

Good questions.

  • I don't like to tamper with original files. I would leave them as they are. Later on in a couple of years maybe the function could be modified to load data from a certain date (or last X period) which would exclude these files.
  • That might be useful but they aren't much of a problem. I don't think the package is the right place to share analysis code on an article or branch. But sharing a minimal insight from the data collected might be useful for users of the dashboard (I would report numbers but hide or fold the code used).

@Bisaloo Bisaloo requested review from maelle and Bisaloo March 29, 2022 09:55
Copy link
Member

@Bisaloo Bisaloo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me.

A couple of points to update:

  • I'd like more comments about the various formats and when they were in use. I'll submit commit suggestions for this later
  • Could you rename the file to download_history.R please?
  • Could you add a roxygen title?
  • Could you add a roxygen comment with #' @importFrom utils download.file read.csv unzip?
  • Once the two points above have been addressed, could you run devtools::document()?

I'm happy to post commit suggestions or do the changes myself if convenient. Please let me know ☺️

@llrs
Copy link
Contributor Author

llrs commented Apr 5, 2022

Sorry, I almost forgot about this.
I'll do the changes and update the documentation (probably this week). But if you want to provide suggestion on the comments/dates go ahead.

@llrs
Copy link
Contributor Author

llrs commented Apr 6, 2022

I think I addressed all the point raised. I took the liberty to use desc::desc_normalize to arrange the order of dependencies after adding utils.

@llrs
Copy link
Contributor Author

llrs commented Apr 20, 2022

Gentle(?) reminder @Bisaloo and @maelle (I just remember about this and wasn't sure if I had updated the PR or not)

Copy link
Member

@Bisaloo Bisaloo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh, I missed the fact you added comments with the dates so I wrote my own. I tried to merge what we both did. Please let me know what you think.

One final comment: I just realized now you're not in the authors of this package. Could you add yourself as aut please?

@llrs
Copy link
Contributor Author

llrs commented Apr 22, 2022

👍 I fixed the comments, reordered the code and I added myself as an author

@Bisaloo Bisaloo linked an issue Apr 24, 2022 that may be closed by this pull request
@Bisaloo Bisaloo merged commit 29dc28e into r-hub:main Apr 24, 2022
@Bisaloo
Copy link
Member

Bisaloo commented Apr 24, 2022

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Function to analyze history branch/data
2 participants