-
Notifications
You must be signed in to change notification settings - Fork 164
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About training vae for discrete-valued data #150
Comments
Hello @Melon-Xu, Thank you for the kind words. Can you try to rescale the values in the range [0, 1]? Best, Clément |
Thank you for your reply. I will try it. |
Hello @Melon-Xu Thanks for sharing those results. It seems indeed that your data is very sparse (only a few points are non-zero) which may explain the loss value. Can you share the encoding-decoding architecture you are currently using? I have also experienced that VAE with simple architectures may struggle with this type of data when training a VAE on face landmarks. Another approach would consist in changing the data a bit to make them look like a continuous distribution by smoothing the values near the non-zero points (to make it look like a mixture of Gaussian, for instance). This would make the task easier, and at prediction time, you can postprocess the output of the VAE to get the final points. I hope this helps, Clément |
Thank you for your excellent work! It is really helpful and convenient to use.
I have some image data with discrete pixel values, for example, only 0 and 255. I tried training a VAE on it but the loss is really huge and the reconstruction is not that good.
Do you have any suggestions on training VAE for this kind of data, like model selection, and hyper-parameter selections?
Thank you very much!
The text was updated successfully, but these errors were encountered: