Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Results from encoding with 44kHz model at 8kbps does not yield an 8kbps encoding #73

Open
fred5825 opened this issue Jun 14, 2024 · 3 comments

Comments

@fred5825
Copy link

Hello,

I am running the example script with the 44kHz model at a 8kbps bitrate but when I manually calculate the bitrate it ends up being 16kbps. I have encountered the same issue with running the 16kHz model at 8kbps bitrate, I calculate the actual bitrate to be ~20kbps. The audio sample I used for both are sample at 44.1kHz and 16kHz respectively and both mono channels. Curios if anyone had gotten this issue?

@liangjindeamo-yuer
Copy link

I have the same problem and the results are the same as yours! The actual bitrate is ~20kbps, what's wrong with it?

@listener17
Copy link

@fred5825 "44kHz model at a 8kbps bitrate but when I manually calculate the bitrate it ends up being 16kbps. " --- maybe you're using the 16 kbps model?

@KemalAltwlkany
Copy link

Hi, I would like to report unexpected results as well in terms of bitrate.

My colleague and I have used the examples provided from the main github page. We applied the compress method to a 9.62 second audio file in .wav format. The resulting bitrate is:

  • ~16kbps for the 44.1kHz model
  • ~57kbps for the 24kHz model.

Generally the library is poorly documented with not enough explanation as to what the values are which are returned from the encode method (z, codes and latents) as it is unclear which should be used for storing compressed audio. We inspected the source code and concluded that the compress method stores the variable codes as uint16 into a DACFile. When using method decompress, the codes variable is used to recreate z from which audio is reconstructed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

4 participants