You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've wrote this week audio.vadwebrtc as I needed a quick way to remove from audio files segments without voice as I need to transcribe audio files with audio.whisper and that model hallucinates on audio segments containing only silences.
Would it be possible technically to allow multiple start/total_times so that these are all combined in 1 file? So that I can write something like this: av_audio_convert(file, output = "test.wav", start_time = voiced$start, total_time =voiced$end - voiced$start), generating 1 output file?
The text was updated successfully, but these errors were encountered:
I've wrote this week audio.vadwebrtc as I needed a quick way to remove from audio files segments without voice as I need to transcribe audio files with audio.whisper and that model hallucinates on audio segments containing only silences.
I looked at this chunk of code in package av:
av/src/video.c
Lines 652 to 670 in 58d7026
and it only handles one start_time / total_time.
My code looks like this to extract from an audio file only the part containing voice.
Would it be possible technically to allow multiple start/total_times so that these are all combined in 1 file? So that I can write something like this:
av_audio_convert(file, output = "test.wav", start_time = voiced$start, total_time =voiced$end - voiced$start)
, generating 1 output file?The text was updated successfully, but these errors were encountered: