Skip to content
This repository has been archived by the owner on Jan 26, 2022. It is now read-only.

a question about the time consumed by copying tensor from cpu to gpu #215

Open
qimw opened this issue Apr 24, 2019 · 1 comment
Open

a question about the time consumed by copying tensor from cpu to gpu #215

qimw opened this issue Apr 24, 2019 · 1 comment

Comments

@qimw
Copy link

qimw commented Apr 24, 2019

Hi all, I am trying to add lighthead module on faster rcnn. I add a new light_branch on faster_rcnn_heads.py which conbines the functions of box_head and box_out. This module needs to move rois in rpn_return from cpu to gpu but I find this operation much slowly than the roi_feature_transform in module_builder.py (0.04s vs <1e-3s).
Then I move these operations on module_builder.py, and the time is much less, but the time of get restore_bl increased to 0.04s.
This question puzzled me for a long time and could any one figure it out?

@controny
Copy link

I am also puzzled by this problem...
Have you figured it out yet?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants