Add support for other transcription services #90

MtCelesteMa · 2024-05-11T08:42:14Z

Description of proposed feature

Currently, manim-voiceover only supports Whisper as a transcription service, and it is hard coded for all SpeechService backends. I propose that the manim_voiceover.services module be modified to have flexibility in which transcription backend is being used.

How can the new feature be used?

Not only will this give users a choice in which transcription service to use, but it will also make it much easier for users to add transcription services that are not yet covered.

Additional comments

Whisper is no longer the best transcription service available, being beaten by other services such as AssemblyAI, which also supports word-level timestamps among other features.

The text was updated successfully, but these errors were encountered:

MtCelesteMa added the enhancement New feature or request label May 11, 2024

MtCelesteMa assigned osolmaz May 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for other transcription services #90

Add support for other transcription services #90

MtCelesteMa commented May 11, 2024

Add support for other transcription services #90

Add support for other transcription services #90

Comments

MtCelesteMa commented May 11, 2024

Description of proposed feature

How can the new feature be used?

Additional comments