Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

improve performance of stream hashing #51

Open
deanmalmgren opened this issue Apr 17, 2014 · 1 comment
Open

improve performance of stream hashing #51

deanmalmgren opened this issue Apr 17, 2014 · 1 comment

Comments

@deanmalmgren
Copy link
Owner

This is already noticably slow with the simple image-colors example. It takes a very long time to get started.

@deanmalmgren
Copy link
Owner Author

Might be enough to use a command line tool instead of streaming through python. md5sum is pretty darn fast, even on the image-colors example. For example md5sum does the 9.5GB wikipedia corpus in 1 minute on a virtualbox hosted Ubuntu VM.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant