How to create audio profiles for speakers for re-identification? #1766

hmehdi515 · 2024-05-28T14:34:03Z

❓ Questions and Help

Any way to create voice profiles?

What is your question?

Is there a way to store and re-identify voice profiles (similar to BOT-Sort for classifying and re-identifying images) by extracting embeddings that characterize the speaker's voice?
Example:
Give model an audio file with containing multiple voices and use speech diarization for segmentation.
Store the embeddings in a structure database for future comparison and re-identification.
Compare new embeddings against stored ones to identify if a speaker has been previously encountered.

What's your environment?

OS (e.g., Linux): Windows
FunASR Version (e.g., 1.0.0): 1.0.5

hmehdi515 added the question Further information is requested label May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to create audio profiles for speakers for re-identification? #1766

How to create audio profiles for speakers for re-identification? #1766

hmehdi515 commented May 28, 2024 •

edited

Loading

How to create audio profiles for speakers for re-identification? #1766

How to create audio profiles for speakers for re-identification? #1766

Comments

hmehdi515 commented May 28, 2024 • edited Loading

❓ Questions and Help

What is your question?

What's your environment?

hmehdi515 commented May 28, 2024 •

edited

Loading