-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Docs details #2690
Docs details #2690
Conversation
verbose option has been removed in df94a7c Now there is no easy way to remove the progress bar. Using the hack in #2651 (comment) would make the code snippet too complicated.
Thanks for all the comments and for the corrections in the docs ! About all the points you mentioned:
Yes good idea
It refers to examples scripts inside the git repository of the library, see the
This is outdated and must be replaced by
We can replace the
Let's add
Currently there's no documentation for the CSV loader config. Maybe we can add the docstrings to the
This is the same as in
The function
Yes good idea !
Sure why not. Moreover the csv loader now supports remote files so you could just run the code pass an an URL to the sample csv file.
This can be used for distributed processing or just to use a percentage of the data. We can definitely give example of use cases
|
Co-authored-by: Quentin Lhoest <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the corrections.
Though not all your comments have been addressed in this PR, we can already merge it.
As you may know we'll have a new documentation soon anyway ;)
Some comments here:
pip install datasets transformers torch 'datasets[streaming]'
(maybe just link to https://huggingface.co/docs/datasets/installation.html + a one-liner that installs all the requirements / alternatively a requirements.txt file)data_dir
disable_progress_bar
, see https://huggingface.co/docs/datasets/search.html?q=disable_progress_bar, even if the code contains docstrings. Does it mean that the function is not officially supported? (otherwise, maybe it also deserves a mention in https://huggingface.co/docs/datasets/package_reference/logging_methods.html)training_args
has not been defined before in the doc.