Skip to content

Provide a topic and it will create a narrated video: Generate video script -> generate caption images -> narrate the video script -> bind it together into a video mp4 (Pretty cool if you ask me). This is just a PoC.

License

Notifications You must be signed in to change notification settings

eldare/Simple_Ai_Video_Generator

Repository files navigation

Simple Ai Video Generator (PoC)

^^ The amazing Dalle-2 API /s

Output Example:

A demo output video generated by this project can be found here:

How to run:

  • Install ffmpeg: https://ffmpeg.org/download.html (or, you can probably install it as a package: brew install ffmpeg, apt install ffmpeg, etc.)
  • Install Pip Environemnt: pipenv install && pipenv shell
  • Install Gemfile: gem install bundler && bundle install
  • Set OpenAI API Key: export OPENAI_API_KEY=<KEY>
  • Set Elevenlabs API Key: export ELEVEN_API_KEY=<KEY>
  • In ./src/main.py (bottom): set the channel name, the desired topic, voice name, and the destination dir.
  • Run ./src/main.py

PoC development:

  • GPT-turbo to generate video script and captions
  • Generate images with Dalle2 (or something much better if possible)
  • Elevenlabs api to generate narration
  • Get length of mp3 speech
  • Calculate time of each slide (equal times for now)
  • Generate output mp4 video
  • Background music
    • TBD

Known issues:

  • QuickTime Player on MacOS plays the audio incorrectly after a few seconds of playback. There is no problem with the video and it's audio, it's an issue with this specific player. The video can be uploaded without issues to YouTube and played with other players.

Contribution:

Feel free to fork, suggest ideas, report issues, and give general constructive feedback.

^(;,;)^

About

Provide a topic and it will create a narrated video: Generate video script -> generate caption images -> narrate the video script -> bind it together into a video mp4 (Pretty cool if you ask me). This is just a PoC.

Topics

Resources

License

Stars

Watchers

Forks