-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pretrained models performing poorly on dense video captioning #22
Comments
I have the same issue with the HowTo100M + VidChapters-7M + YouCook2 model. For this video the model gives these captions:
while for this video it gives:
|
The HowTo100M + VidChapters-7M + ViTT model is performing poorly on dense video captioning.
Reproduction:
Run
to download this specific video.
Follow the steps in the demo using the HowTo100M + VidChapters-7M + ViTT checkpoint.
Output captions:
The text was updated successfully, but these errors were encountered: