Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replicating the bits / dim results in the original VQ-VAE paper #1

Open
pclucas14 opened this issue Jun 11, 2019 · 7 comments
Open

Comments

@pclucas14
Copy link

Hi,

thanks for your clean implementation ! I was wondering, have you ever tried to calculate the bits / dimension metric (as in the original paper) ? I've tried to do so using the provided code, and I'm still quite far from the results in the paper. I was hoping maybe you would have some insight to share as to why that is the case.

Thanks!
Lucas

@kashif
Copy link
Contributor

kashif commented Jun 15, 2019

@pclucas14 I haven't tried to do that but plan to... give me a few days to figure it out... I am trying to compile a table of bits/dim for MNIST/ Fashion-MNIST so I might as well start with this

@pclucas14
Copy link
Author

great! let me know what you end up getting :)

@kashif
Copy link
Contributor

kashif commented Jun 15, 2019

Thanks! Here is where I am collecting the results: zalandoresearch/fashion-mnist#144

@pclucas14
Copy link
Author

great! it's a very complete list, pretty cool :D

@pclucas14
Copy link
Author

The best I got for VQ-VAE on CIFAR-10 was 5.00 BPD (using a unigram prior). This is still far from the results in the VQ-VAE paper. Hopefully you can manage to close the gap even more :)

@pvitoria
Copy link

Hi @kashif ,
could you provide the code you use to compute the bpd?

Thanks!

@podlipensky
Copy link

Wonder if there is code for proper bpd computation? If so, can you please share?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants