-
Notifications
You must be signed in to change notification settings - Fork 263
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
argmax calculation on the GPU? #367
Comments
Hi Timo, |
Hi Nikolaev, |
Hi @TimoSaemann |
Hi @drnikolaev |
Hi @TimoSaemann it's on my plate but there are few urgent bugs to fix before the next release, sorry about this. |
Hi,
I have found that the argmax calculation is a bottleneck. While the forward pass of my net needs just 18 ms, the argmax calculation needs about 40 ms (1024x544 px). I'm sure you could speed up this calculation if you calculate it on the GPU. Would it be possible for you to implement an argmax.cu? That would be really helpful! Thanks!
Best,
Timo
The text was updated successfully, but these errors were encountered: