Skip to content
Discussion options

You must be logged in to vote

Hi @gormat,

This model does not extract punctuation marks and does not split into sentences, am I missing something?

in most of cases written punctuation is not related to how people actually speak (except for maybe ?)
in the CE version of our models we just provide the STT part
but typically the pipeline may work as follows:

  • you can use some VAD to split speech into utterances. typically people separate ideas with pauses
  • you can tune some pre-trained LM like a transformer to add capital letters, commas, full-stops etc

Is there a way to get start and end timestamps for each word as well?

  • Please use this colab Open In Colab
  • Proceed to PyTorch example
  • Search for comment # align example

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by snakers4
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #41 on January 25, 2021 12:05.