-
I'm excited to try the OWSM 3.1 model, but the snippet provided on HuggingFace under "Use in ESPnet" does not work for me: from espnet2.bin.asr_inference import Speech2Text
model = Speech2Text.from_pretrained(
"espnet/owsm_v3.1_ebf"
) I get the following error: ---------------------------------------------------------------------------
TypeError Traceback (most recent call last)
Cell In[1], line 3
1 from espnet2.bin.asr_inference import Speech2Text
----> 3 model = Speech2Text.from_pretrained(
4 "espnet/owsm_v3.1_ebf"
5 )
File ~/mambaforge/envs/espnet/lib/python3.10/site-packages/espnet2/bin/asr_inference.py:679, in Speech2Text.from_pretrained(model_tag, **kwargs)
676 d = ModelDownloader()
677 kwargs.update(**d.download_and_unpack(model_tag))
--> 679 return Speech2Text(**kwargs)
TypeError: Speech2Text.__init__() got an unexpected keyword argument 's2t_train_config' I tried What is the correct code to load the model and run transcription? Thanks! |
Beta Was this translation helpful? Give feedback.
Answered by
sw005320
Mar 28, 2024
Replies: 2 comments
-
It should be @pyf98, we encountered this misuse in two cases.
|
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
cifkao
-
yeah, I see. We need to update "Use in ESPnet". I think it is defined in Hugging Face not ESPnet? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
It should be
espnet2.bin.s2t_inference
notespnet2.bin.asr_inference
@pyf98, we encountered this misuse in two cases.