-
Notifications
You must be signed in to change notification settings - Fork 258
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unify example dataset in configs #916
Comments
There are memory and perf implications with using one or the other, I think some were purposely set to cleaned or normal alpaca but not sure, cc @rohan-varma |
I think @ebsmothers has more context on this, IIUC some were set to clean when we were trying to replicate existing studies. |
Personally I would just recommend making cleaned the default as that's what most folks seem to have converged on |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
We use alpaca cleaned and alpaca. Should just use one for our example datasets.
The text was updated successfully, but these errors were encountered: