You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am currently conducting a research project with my partner on developing an SER model for New Zealand English. We evaluated the model you provided here and achieved promising results but would like to fine tune it on another corpus.
We were wondering what input format the model expects our dataset to be in for training. We have it as a Dataset object using the datasets library from HuggingFace. The debug console in the below image shows the structure of our Dataset. It currently has audio, arousal, and valence annotations as inputs to the model.
Was this the input used, or was a different input expected?
The text was updated successfully, but these errors were encountered:
Hi,
I am currently conducting a research project with my partner on developing an SER model for New Zealand English. We evaluated the model you provided here and achieved promising results but would like to fine tune it on another corpus.
We were wondering what input format the model expects our dataset to be in for training. We have it as a Dataset object using the datasets library from HuggingFace. The debug console in the below image shows the structure of our Dataset. It currently has audio, arousal, and valence annotations as inputs to the model.
Was this the input used, or was a different input expected?
The text was updated successfully, but these errors were encountered: