-
Notifications
You must be signed in to change notification settings - Fork 177
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Log softmax vs softmax #87
Comments
You can configure the PyTorch loss function to take log of targets or just targets. By default the targets are not in log-space and so this is what I used. There may be numerical stability benefits but honestly I don’t remember if there was some rationale behind this. There are examples of both in the docs: https://pytorch.org/docs/stable/generated/torch.nn.KLDivLoss.html |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
WhisperSpeech/whisperspeech/vq_stoks.py
Line 344 in 80b268b
Why use log softmax on the model logits, but softmax on the teacher logits?
The text was updated successfully, but these errors were encountered: