Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Why the OnlineASRProcessor.commited is always incremented? #19

Open
bianxg opened this issue Oct 21, 2023 · 2 comments
Open

Why the OnlineASRProcessor.commited is always incremented? #19

bianxg opened this issue Oct 21, 2023 · 2 comments

Comments

@bianxg
Copy link

bianxg commented Oct 21, 2023

Why the OnlineASRProcessor.commited is always incremented (it includes all commited transcripts from the beginning)?
As time passes, the "commited" grows large. Should we truncate it ?

@Gldkslfmsd
Copy link
Collaborator

Gldkslfmsd commented Oct 21, 2023

Hi,
thanks for feedback. This is an unresolved corner case. Yes, you can truncate it after this line

self.commited.extend(o)

, but there must remain 200 last characters. Or other size, I haven't experiment with the 200 here:

while p and l < 200: # 200 characters prompt size
, it can be tuned/improved.

Does it the current version without truncating harm the performance? Do you have memory issues? Or is it just annoying in the log?
After how long audio/how many words? Usually nobody runs one audio processing so long so that's an issue.

@bianxg
Copy link
Author

bianxg commented Oct 23, 2023

Thanks. I just review the code and think it will be harmful to performance and memory. I think of its application in video conference.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants