Sequential processing of video frames in backbone #73

xserban · 2022-05-26T06:59:51Z

I have a question regarding your implementation, specifically the way you pass raw data to the backbone.
If the input is a video with 36 frames, how are they being processed by a ResNet-50/101?

In the BackboneBase class, forward method, you are passing the tensor list to a backbone, but the backbone is expecting an input of size [64, 3, 7, 7]. As I understand from the paper you are reshaping the videos to [36x300x540] .. so there should be one more preprocessing step from the videos to the backbone.

Can you shed some light on this extra step?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sequential processing of video frames in backbone #73

Sequential processing of video frames in backbone #73

xserban commented May 26, 2022 •

edited

Loading

Sequential processing of video frames in backbone #73

Sequential processing of video frames in backbone #73

Comments

xserban commented May 26, 2022 • edited Loading

xserban commented May 26, 2022 •

edited

Loading