You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have just used the pyannote-segmentation model for Voice Activity Detection. I very much liked how I was able to evaluate the model's performance. Now, I am seeking a way to use the same model but then to reconstruct the speech segment outputs into an audio. This will be similar to using the VAD model to filter out non-speech portions from an audio. Any help on this is greatly appreciated!
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
I have just used the pyannote-segmentation model for Voice Activity Detection. I very much liked how I was able to evaluate the model's performance. Now, I am seeking a way to use the same model but then to reconstruct the speech segment outputs into an audio. This will be similar to using the VAD model to filter out non-speech portions from an audio. Any help on this is greatly appreciated!
Beta Was this translation helpful? Give feedback.
All reactions