A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
-
Updated
Dec 24, 2024 - Python
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
说话人分割仓库-聚类分割-谱聚类 || a ready-to-use repo for Speaker Diariazation with Spectral Clustering
A simple and private transcription tool able to segment speakers and convert audio to text.
Add a description, image, and links to the speaker-diariazation topic page so that developers can more easily learn about it.
To associate your repository with the speaker-diariazation topic, visit your repo's landing page and select "manage topics."