Skip to content

Dockerfile for speech to text (STT) based on Kaldi ASR (http://kaldi-asr.org/) and 'chain' models (type of DNN-HMM model) from api.ai

Notifications You must be signed in to change notification settings

joshuachris2001/docker-stt

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

FIX ERROR: This script does not work on Python 2.7 The minimum supported Python version is 3.7. Please use https://bootstrap.pypa.io/pip/2.7/get-pip.py instead.

Speech to text service (STT)

Description

Dockerfile for speech to text (STT) based on Kaldi ASR (http://kaldi-asr.org/) Kaldi's 'chain' models (type of DNN-HMM model) used Model released by api.ai team (https://github.com/api-ai/api-ai-english-asr-model)

Usage

In the command line run following commands

docker build -t stt .
docker run -it stt

file in.wav will be processed and output should contain following text:

/opt/in/in.wav HELLO THIS IS SPEECH TO TEXT RECOGNITION FOR JOKER PROJECT 

output also contains a lot of debug messages (ignore it).

Provide your own audio (wav) file

Record your file first with following command (works under Linux)

mkdir in
arecord -d 10 -fS16_LE -r 16000 -c 1 in/in.wav 

or you can prepare in.wav in any other software. File format: 16 bit, mono 16000 Hz

now you can run docker container:

docker run -it -v `pwd`/in:/opt/in stt

(c) Abylay Ospan [email protected], 2017 https://jokersys.com

About

Dockerfile for speech to text (STT) based on Kaldi ASR (http://kaldi-asr.org/) and 'chain' models (type of DNN-HMM model) from api.ai

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Dockerfile 94.3%
  • Shell 5.7%