Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

custom dataset #7

Open
maherr13 opened this issue Mar 13, 2022 · 8 comments
Open

custom dataset #7

maherr13 opened this issue Mar 13, 2022 · 8 comments

Comments

@maherr13
Copy link

Are there procedures/steps/scripts for training the model on a custom dataset?

@ShenhanQian
Copy link
Owner

You can follow the procedures/scripts of Speech2Gesture .

@maherr13
Copy link
Author

Thanks that was helpful, may I ask How to calculate scaling constants for my dataset like in [speakers_stat.py? @ShenhanQian

@ShenhanQian
Copy link
Owner

The scale_factor is used to resize the keypoints of subjects so that they have equal widths of shoulders.

Therefore, you can first compute the mean shoulder width of your own subject, and obtain its scale_factor by its ratio to the mean shoulder width of either subject in our released data.

@ShenhanQian
Copy link
Owner

ShenhanQian commented Mar 20, 2022

I have just uploaded reference scripts for custom data processing.

You can find the mean shoulder width of Oliver here.

oliver_shoulder_dist = 331.0850066245443

Alternatively, you could simply rescale the keypoints with data_preprocess/2_3_rescale_shoulder_width.py so that the new speaker will have the same shoulder width as Oliver, then you can set the new speaker's scale_factor to Oliver's.

@maherr13
Copy link
Author

maherr13 commented Mar 22, 2022

I would like to thank you for the scripts, it's really helpful.

I would like to mention two errors in 2_1_gen_kpts.py :

  1. at line 47 opWrapper.emplaceAndPop takes op.VectorDatum object type so I corrected it to opWrapper.emplaceAndPop(op.VectorDatum([datum]))

  2. at line 82 I think I missed the bracket location I corrected it to len(sys.argv) == 2

@maherr13
Copy link
Author

I tried the pipeline and it worked perfectly fine except that I got the mean and std values for the key points equal to zeros from the last two scripts.

any thoughts why could that happen?

@Garfield-Finch
Copy link
Collaborator

Thank you for your support!
Could you please share your ``clips.csv'' (or a segment of it) to help us debug?

@maherr13
Copy link
Author

sorry for late response,

image

the data I ran through the pipeline was 6 videos each 10-sec max for the same person.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants