custom dataset #7

maherr13 · 2022-03-13T19:27:24Z

Are there procedures/steps/scripts for training the model on a custom dataset?

ShenhanQian · 2022-03-14T01:30:39Z

You can follow the procedures/scripts of Speech2Gesture .

maherr13 · 2022-03-16T11:21:27Z

Thanks that was helpful, may I ask How to calculate scaling constants for my dataset like in [speakers_stat.py? @ShenhanQian

ShenhanQian · 2022-03-17T02:30:18Z

The scale_factor is used to resize the keypoints of subjects so that they have equal widths of shoulders.

Therefore, you can first compute the mean shoulder width of your own subject, and obtain its scale_factor by its ratio to the mean shoulder width of either subject in our released data.

ShenhanQian · 2022-03-20T12:31:27Z

I have just uploaded reference scripts for custom data processing.

You can find the mean shoulder width of Oliver here.

SpeechDrivesTemplates/data_preprocess/2_3_rescale_shoulder_width.py

Line 68 in 1d8182b

oliver_shoulder_dist = 331.0850066245443

Alternatively, you could simply rescale the keypoints with data_preprocess/2_3_rescale_shoulder_width.py so that the new speaker will have the same shoulder width as Oliver, then you can set the new speaker's scale_factor to Oliver's.

maherr13 · 2022-03-22T17:37:42Z

I would like to thank you for the scripts, it's really helpful.

I would like to mention two errors in 2_1_gen_kpts.py :

at line 47 opWrapper.emplaceAndPop takes op.VectorDatum object type so I corrected it to opWrapper.emplaceAndPop(op.VectorDatum([datum]))
at line 82 I think I missed the bracket location I corrected it to len(sys.argv) == 2

maherr13 · 2022-03-22T17:43:57Z

I tried the pipeline and it worked perfectly fine except that I got the mean and std values for the key points equal to zeros from the last two scripts.

any thoughts why could that happen?

Garfield-Finch · 2022-03-23T13:43:57Z

Thank you for your support!
Could you please share your ``clips.csv'' (or a segment of it) to help us debug?

maherr13 · 2022-03-31T12:57:16Z

sorry for late response,

the data I ran through the pipeline was 6 videos each 10-sec max for the same person.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

custom dataset #7

custom dataset #7

maherr13 commented Mar 13, 2022

ShenhanQian commented Mar 14, 2022

maherr13 commented Mar 16, 2022

ShenhanQian commented Mar 17, 2022

ShenhanQian commented Mar 20, 2022 •

edited

Loading

maherr13 commented Mar 22, 2022 •

edited

Loading

maherr13 commented Mar 22, 2022

Garfield-Finch commented Mar 23, 2022

maherr13 commented Mar 31, 2022

custom dataset #7

custom dataset #7

Comments

maherr13 commented Mar 13, 2022

ShenhanQian commented Mar 14, 2022

maherr13 commented Mar 16, 2022

ShenhanQian commented Mar 17, 2022

ShenhanQian commented Mar 20, 2022 • edited Loading

maherr13 commented Mar 22, 2022 • edited Loading

maherr13 commented Mar 22, 2022

Garfield-Finch commented Mar 23, 2022

maherr13 commented Mar 31, 2022

ShenhanQian commented Mar 20, 2022 •

edited

Loading

maherr13 commented Mar 22, 2022 •

edited

Loading