Is there anyway we can get the (approximate) time/frame of the ouput #33
-
Given a clip of 5s speech, is there anyway to get the rough time when each word was spoken? |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments
-
There is an example for PyTorch producing timestamps in the colab |
Beta Was this translation helpful? Give feedback.
-
I see, thanks for pointing out. By turn on the |
Beta Was this translation helpful? Give feedback.
-
In this particular case basic lm is integrated into the model so it knows when words start and end Longer pronunciation to a reasonable extent should not be a problem since this was accounted for during training |
Beta Was this translation helpful? Give feedback.
There is an example for PyTorch producing timestamps in the colab