Skip to content

A Windows desktop application that can generate subtitles, translations, and summaries for videos in 8 languages using API and SDK from Tencent, Alibaba, and Baidu. You can use it for generating bilingual transcripts for videos and summarising the key points from the transcript using LexRank.

License

Notifications You must be signed in to change notification settings

WideSu/lectureNoteAssistant

Repository files navigation

lectureNoteAssistant

A Windows desktop application that can generate subtitles, translations, and summaries for videos in 8 languages using API and SDK from Tencent, Alibaba, and Baidu. You can use it for generating bilingual transcripts for videos and summarising the key points from the transcript using the PageRank algorithm: LexRank.

image

How to use it?

Just download the exe file, and run it on your windows computer.

Supported video types

Supports video and audio files in common formats, including:

  • For videos, we support: .mp4 , .mpeg , .mkv , .wmv , .avi , .m4v , .mov , .flv , .rmvb , .3gp , .f4v .
  • For audio: .mp3 , .wav , .aac , .wma , .flac , .m4a formats.

Output files

It can generate 3 types Support subtitle files including: SRT file, ordinary text, lecture summary simultaneously or seperately according to user's needs. And with bilingual translation between 10 languages including Chinese, English, Japanese, Korean, French, German, Spanish, Russian, Italian, and Thai.

Services and SDKs used:

  • Baidu and tencent Translation SDK
  • go tldr for auto-summary
  • aliyun-cloud-sdk-go,aliyun-oss-go-sdk,tencentcloud-sdk-go for storing the audio files and generated transcripts
  • the intelligent voice interactive service on Aliyun

Demo

  • Firstly, upload the video or audio on your computer

  • Secondly, submit it. The software will start to generate transcripts for you

System architecture

This system uses the go walk library for development, and main.go contains the code and main logic of the main interface of the program. It calls the relevant code files in the app package to perform corresponding operations. Interface logic code, data object separation. Basically similar to the MVC pattern. It is divided into presentation layer, business logic layer and data access layer. The presentation layer is used to interact with the user, and then calls the functions of the corresponding modules in the app package to perform business operations, and the corresponding modules of the business operations then call the data layer functions to operate on the data.

image

About

A Windows desktop application that can generate subtitles, translations, and summaries for videos in 8 languages using API and SDK from Tencent, Alibaba, and Baidu. You can use it for generating bilingual transcripts for videos and summarising the key points from the transcript using LexRank.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages