Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
vitali-fedulov committed Feb 6, 2024
1 parent afce02b commit ad8a6b3
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

This is a fast and RAM-friendly hash-table-based image comparison package **for large image collections** (thousands and more). Resized and near-duplicate images can be found with it.

When numBuckets parameter is low (~4), the package is a **rough pre-filtering first step**. Then the **second precise step** is needed with [images4](https://github.com/vitali-fedulov/images4) on the image set produced in the first step. This 2 step sequence (imagehash > images4) is necessary, because direct one-to-all comparison with images4 might be slow for very large image collections. For small image sets it is OK to skip the first step altogether.
When numBuckets parameter is low (~4), the package is a **rough pre-filtering first step**. Then the **second precise step** is needed with [images4](https://github.com/vitali-fedulov/images4) on the image set produced in the first step. This 2 step sequence (imagehash > images4) is necessary, because direct one-to-all comparison with images4 might be slow for very large image collections. For small image sets it is easier to skip the first step altogether.

When numBuckets is very high (~200), be sure to do a few tests, because, as in the example below, only 10 dimensions (pixels in Y channel) are used from the total 11x11*3=363 pixel values in the icon. This could under-represent some images, because bucket width is very small for high numBuckets.

Expand Down

0 comments on commit ad8a6b3

Please sign in to comment.