Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about more frames #30

Open
Debrove opened this issue Oct 27, 2023 · 6 comments
Open

Question about more frames #30

Debrove opened this issue Oct 27, 2023 · 6 comments

Comments

@Debrove
Copy link

Debrove commented Oct 27, 2023

Hi! Sorry to bother you again~

I tried to input more frames (16 frames instead of 8 frames) on the ImageNet-pre-trained r50 model, but it resulted in worse results (40.5 mAP and 51.6 NDS). I just set the num_frames = 16 in the config. Is that something wrong?

Looking forward to your reply! Thank you!

@afterthat97
Copy link
Collaborator

It's strange. I didn't try more than 8 frames, but it should improve the performance. I will have a try when I have free GPUs.

@afterthat97
Copy link
Collaborator

By the way, why don't you use the R50 pretrained on nuImg? It's more stable.

@Debrove
Copy link
Author

Debrove commented Oct 27, 2023

By the way, why don't you use the R50 pretrained on nuImg? It's more stable.

Thanks for your reply. I will have a try.

@Debrove
Copy link
Author

Debrove commented Oct 28, 2023

By the way, why don't you use the R50 pretrained on nuImg? It's more stable.

Hi~ I ran the R50 pretained on nuImg (configs). I obtained the baseline results (44.7 mAP and 55.5 NDS) and the results of 16 frames (42.6 mAP and 53.1 NDS). 🤔️

@afterthat97
Copy link
Collaborator

I need to run a few experiments to find out why, thanks for your patience!

@Gigalomanicx
Copy link

Hi @Debrove @afterthat97 , I'm also working on this work. in my opinion, 16 frames cover a fairly long span of around 8 seconds in the NuScenes dataset, which raises concerns about the validity of aligning object motion with ego motion. Additionally, it’s uncertain whether the same object can be consistently captured over this time window. For instance, an object in the front view might remain visible for a longer period, whereas this wouldn’t necessarily be the case for side views.

image
Example 1: As the frame increases, the prediction of offsets for sampling point locations seems to become increasingly inaccurate.
image
Example 2: At a corner with an occlusion relation, the occlusion relation seems to be ignored for the pedestrian sampling points in the pre-frame.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants