-
Notifications
You must be signed in to change notification settings - Fork 242
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement RoIAlign #3493
base: develop
Are you sure you want to change the base?
Implement RoIAlign #3493
Conversation
@@ -39,3 +39,4 @@ The MIOpen API library is structured as follows: | |||
* :doc:`ReLU <../doxygen/html/group___re_l_u>` (experimental) | |||
* :doc:`Kthvalue <../doxygen/html/group__kthvalue>` (experimental) | |||
* :doc:`GLU <../doxygen/html/group__glu>` (experimental) | |||
* :doc:`RoIAlign <../doxygen/html/group__RoIAlign>` (experimental) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
* :doc:`RoIAlign <../doxygen/html/group__RoIAlign>` (experimental) | |
* :doc:`RoIAlign <../doxygen/html/group___ro_i_align>` (experimental) |
@anhskrttt Do we need a changelog entry for this addition? |
@amd-jnovotny I actually couldn't find any MIOpen documentation specifying which types of additions require a changelog entry...? I noticed that similar ops, such as cc @long10024070 Do you see any issues with adding this op to the changelog? |
@amd-jnovotny I think editing changelog should be totally handled from AMD side. It is not our decision. |
@long10024070 thanks for the feedback. @BrianHarrisonAMD Should we add an entry to the MIOpen changelog? Maybe as part of a separate PR? If you can add something, I can review it. |
RoIAlignForward
, MIOpen performs better if input and output tensors are contiguous.RoIAlign
doesn't support forbfloat16
yet, so this is an always winning case.RoIAlignBackward
, MIOpen performs better if:float16
typeinput_numel
size is smaller)Average improvement over ROCm
Detail Benchmark
fp32_fwd
fp16_fwd
fp16_bwd