Skip to content

Files

Latest commit

ef90337 · Feb 27, 2024

History

History
This branch is 4 commits ahead of, 26 commits behind reazon-research/ReazonSpeech:master.

espnet-asr

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Feb 16, 2024
Feb 5, 2024
Feb 27, 2024

reazonspeech.espnet.asr

This supplies the main interface for using ReazonSpeech ESPnet models.

More information can be found at https://research.reazon.jp/projects/ReazonSpeech

Install

$ git clone https://github.com/reazon-research/ReazonSpeech
$ pip install ReazonSpeech/pkg/espnet-asr

Usage

Python interface

from reazonspeech.espnet.asr import load_model, transcribe, audio_from_path

# Load ReazonSpeech model from Hugging Face
model = load_model()

# Read a local audio file
audio = audio_from_path("speech.wav")

# Recognize speech
ret = transcribe(model, audio)

Comnand-line interface

$ reazonspeech-espnet-asr speech.wav

Use -h to show a full help.

$ reazonspeech-espnet-asr -h