Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add TLSH hashing of response bodies as an optional feature #680

Open
kz0ltan opened this issue Nov 30, 2023 · 2 comments
Open

Add TLSH hashing of response bodies as an optional feature #680

kz0ltan opened this issue Nov 30, 2023 · 2 comments
Labels
Type: Enhancement Most issues will probably ask for additions or changes.

Comments

@kz0ltan
Copy link

kz0ltan commented Nov 30, 2023

Please describe your feature request:

TLSH hashing of response bodies to create a way to offline compare crawled files.

Describe the use case of this feature:

To compare file "signatures" offline (without saving the crawled files) hashes are the simple and usually accepted solution. However, with dynamic pages, the requested files can be different on each request. TLSH could be used to solve this issue (without getting into interpreting the files, which could be another , more complex feature).
In order to avoid slowing down the crawler, the feature could be added as optional that can be enabled with a CLI flag.
Output is only needed in JSON IMO.

@kz0ltan kz0ltan added the Type: Enhancement Most issues will probably ask for additions or changes. label Nov 30, 2023
@kz0ltan
Copy link
Author

kz0ltan commented Nov 30, 2023

Coding has been done already, pull request is coming.

@kz0ltan kz0ltan mentioned this issue Nov 30, 2023
@kz0ltan
Copy link
Author

kz0ltan commented Nov 30, 2023

#681

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Type: Enhancement Most issues will probably ask for additions or changes.
Projects
None yet
Development

No branches or pull requests

1 participant