Skip to content
Discussion options

You must be logged in to vote

Hi,

If I understand you correctly you can just use the model output as-is and / or write you own decoder.

The shape would be something like batch * frames * tokens. A list of tokens is in the json.

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by snakers4
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #22 on December 09, 2020 07:06.