You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
One great thing about Icechunk is that the manifests store info about all of the chunks, making it in theory fast to know the on-disk size of very large arrays. (This is slow with vanilla Zarr because it requires listing the object store.)
Today I discovered we are not quite there yet though.
basically we implemented getsize that takes a key, so python is still listing all keys and calling this ... which is slow because python <-> rust...
it's the switching between python and rust back and forth for every key that is slow
To fully optimize this, I think we need to implement the store.getsize_prefix method in Rust world.
The text was updated successfully, but these errors were encountered:
One great thing about Icechunk is that the manifests store info about all of the chunks, making it in theory fast to know the on-disk size of very large arrays. (This is slow with vanilla Zarr because it requires listing the object store.)
Today I discovered we are not quite there yet though.
I never had the patience to let the last line finish after waiting minutes.
From @paraseba on slack
To fully optimize this, I think we need to implement the
store.getsize_prefix
method in Rust world.The text was updated successfully, but these errors were encountered: