-
-
Notifications
You must be signed in to change notification settings - Fork 706
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Speaker Diarization pipeline.get_segmentations produces integer ascending start/ends instead of something useful #1685
Comments
Okay I dug through the code and see that the actual start/ends are created later in to_diarization or to_annotatin. However, trying to diarize the new audio file this way using existing clusters (with the same speaker- me) results in totally different (and very bad) annotations compared to just running the pretrained pipeline on the file directly. Running by itself produces this set of segments:
While doing the method I described with existing clusters gives me:
|
Tested versions
3.1
System information
macOs 13.6 - pyannote 3.1 - M2 air
Issue description
Im running ```
self.pipeline = Pipeline.from_pretrained(
"pyannote/speaker-diarization-3.1", use_auth_token=os.environ["HF_API_KEY"]
)
segmentations = self.pipeline.get_segmentations({'waveform': torch.from_numpy(waveform), 'sample_rate': sample_rate})
splits = [(segment, data) for segment, data in segmentations]
The text was updated successfully, but these errors were encountered: