Skip to content
Discussion options

You must be logged in to vote

Hi,

We do not have public built-in streaming interface for our models for simplicity reasons.

You can try our VAD to split audio into chunks - https://github.com/snakers4/silero-vad

STT best works for 5-15s audio chunks anyway.

In case some chunk is longer, you may use an align method in the decoder, apply it to a fixed length chunk, split on some word and just run STT the second time on the subchunks.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@ZhengHe-MD
Comment options

Answer selected by snakers4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
help wanted Extra attention is needed
3 participants
Converted from issue

This discussion was converted from issue #101 on October 08, 2021 03:29.