feat(dataset): dataset as config pilot code #47
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Resolves #46
Depends on #75
A pilot study of dataset as configs.
Background
#15
#46
Experiment
In this PR, we implemented a small example of how to define dataset using a
yaml
file. In this example, we provided a yaml filedatasets/minimal.yaml
,hamilflow/datasets/miminal.yaml
Lines 1 to 11 in 96ca800
When we run a command
we can save the dataset in a specified location.
A few things are ignored in this example:
How to Improve this Minimal Example
!
in the keys. We can implement a better way to define a model using customized data.dataset = genenrate_dataset(path_to_config)
A few questions