Can I implement online learning on a big CSV Dask dataset? #938
Answered
by
MaxHalford
jorgesisco
asked this question in
Q&A
-
I have 80 csv files that I loaded them all in one dataframe using Dask, after transforming the data using onehotencoding, I get a really huge dataset impossible to train even with large instances on amazon sagemaker. I am trying to find an alternative by training the model with partial data but using the whole dataset at the end, is this possible? Any advice would be welcome 🙏 |
Beta Was this translation helpful? Give feedback.
Answered by
MaxHalford
May 24, 2022
Replies: 1 comment 4 replies
-
Hello. Yes, I would say it's possible with River. What is blocking you? Have you tried using River? |
Beta Was this translation helpful? Give feedback.
4 replies
Answer selected by
MaxHalford
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hello. Yes, I would say it's possible with River. What is blocking you? Have you tried using River?