You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey everyone! Firstly, thanks for your work and a great product!
I have a feature request about ability to configure BUFFER_SIZE in transcriber service which is currenlty only 500ms.
The use case is following: I don't need live captions in meetings, but I do need transcriptions.
What I'm trying to do is to use Jitsi Skynet for transcription with Faster-Whisper. And since I don't need live captions but only resulting transcription - it looks like transcribing input stream using 500ms chunks isn't optimal. I assume that increase of buffer size can result into less workload on whisper service which is very desirable. Correct me, if I'm wrong
Thanks in advance
The text was updated successfully, but these errors were encountered:
Hi, I will work on this when I will have some time on my hands. And btw, for the skynet transcriber the buffer size is ~1.2 seconds as the calculations in the participant class assume that the audio uses a 48k sampling rate everywhere, but skynet requires 16k. See
Yeah, you're right, buffer is bigger for whisper, but still too small and processed too frequent. I've confirmed much less workload on Skynet by modifying demo to use 5 seconds buffer
Hey everyone! Firstly, thanks for your work and a great product!
I have a feature request about ability to configure
BUFFER_SIZE
in transcriber service which is currenlty only500ms
.The use case is following: I don't need live captions in meetings, but I do need transcriptions.
What I'm trying to do is to use Jitsi Skynet for transcription with Faster-Whisper. And since I don't need live captions but only resulting transcription - it looks like transcribing input stream using
500ms
chunks isn't optimal. I assume that increase of buffer size can result into less workload on whisper service which is very desirable. Correct me, if I'm wrongThanks in advance
The text was updated successfully, but these errors were encountered: