Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Word-level timestamps are very inaccurate #294

Closed
a-rogalska opened this issue Jun 12, 2023 · 2 comments
Closed

Word-level timestamps are very inaccurate #294

a-rogalska opened this issue Jun 12, 2023 · 2 comments

Comments

@a-rogalska
Copy link

I'm using large-v2 model to transcribe multilingual audio (many of them are in German). There are many cases, usually at the beginning of the segment, when word-level timestamps are incorrect, with the start time later than the end time. I know that whisper-timestamped has pretty accurate results, but I would like to use faster-whisper instead of the original whisper implementation.

Is there a way to improve timestamp accuracy here?

image

@Purfview
Copy link
Contributor

Purfview commented Jun 12, 2023

Try #226 PR, it helps with word timestamps.

@a-rogalska
Copy link
Author

It helped, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants