Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Ability to configure BUFFER_SIZE for transcription #525

Open
miro-ku opened this issue Mar 7, 2024 · 2 comments
Open

Comments

@miro-ku
Copy link

miro-ku commented Mar 7, 2024

Hey everyone! Firstly, thanks for your work and a great product!

I have a feature request about ability to configure BUFFER_SIZE in transcriber service which is currenlty only 500ms.
The use case is following: I don't need live captions in meetings, but I do need transcriptions.
What I'm trying to do is to use Jitsi Skynet for transcription with Faster-Whisper. And since I don't need live captions but only resulting transcription - it looks like transcribing input stream using 500ms chunks isn't optimal. I assume that increase of buffer size can result into less workload on whisper service which is very desirable. Correct me, if I'm wrong

Thanks in advance

@rpurdel
Copy link
Contributor

rpurdel commented Mar 11, 2024

Hi, I will work on this when I will have some time on my hands. And btw, for the skynet transcriber the buffer size is ~1.2 seconds as the calculations in the participant class assume that the audio uses a 48k sampling rate everywhere, but skynet requires 16k. See

and
* The expected amount of bytes each given buffer will have. Webrtc

@miro-ku
Copy link
Author

miro-ku commented Mar 11, 2024

Hi, @rpurdel thanks!

Yeah, you're right, buffer is bigger for whisper, but still too small and processed too frequent. I've confirmed much less workload on Skynet by modifying demo to use 5 seconds buffer

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants