How to accelerate the purge of non-current objects beyond the lifecycle or accelerate MinIO Scanner #17675
-
I have a MinIO cluster, but its scanner has been running very slowly. As a result, there are a large number of non-current objects in the cluster that exceed the non-current lifecycle but have not been purged, these non-current objects will take up my space so that I want to purge them. To address this, I made adjustments using the mc client. Originally, the settings were delay: 10, max_wait: 15, cycle: 1. I adjusted them to delay: 6, max_wait: 10, cycle: 1. However, when I checked the Last full scan time using mc admin scanner status, it showed 150d0h15m. I don't feel any improvement after changing the settings. Is there any way to make the MinIO scanner work faster or to purge non-current objects beyond the lifecycle more promptly? And what will happen with MinIO scanner if I execute mc admin service restart ? The version of MinIO is RELEASE.2022-12-12T19-27-27Z, I used 48 nodes, each node have 10 drives, each drive has 7TiB, EC:6. Belows are the result of mc admin scanner status : With setting delay: 6, max_wait: 10, cycle: 1: |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 7 replies
-
mc admin config set ALIAS scanner speed=default try these - fastest, fast, default, slow, slowest no restart required. |
Beta Was this translation helpful? Give feedback.
-
Please upgrade to latest - if you looking for professional-level help - your cluster needs to subscribe itself at https://min.io/pricing |
Beta Was this translation helpful? Give feedback.
-
Scanner is totally a bad design, I think the full scanner of bucket should be just a of admin command to use for migration or similar to fix something, not a regular running mechanism. Minio should build a metadata database, use some kvdb or tsdb, and use just a simple query to fetch the file list for lifecycle etc. Scan hundreds of thousands or millions of files is horrible. |
Beta Was this translation helpful? Give feedback.
Please upgrade to latest - if you looking for professional-level help - your cluster needs to subscribe itself at https://min.io/pricing