arrays Dynamically generated from other arrays in group e.g. for Remote sensing indices. #2764
benritchie
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey All
I wondered if there was any plans for or interest in dynamically generated arrays in a column group.
I think this could have significant use cases in geo/remote sensing data at least, where its common to want to have easy access to indicies derived from other columns. An example would be ndvi as a dynamic column, defined as (near-IR - red)/ (near-IR + red). (where Near-IR and Red are other arrays in the group).
Although it would be possible to dynamically calculate this on the client side, I think here's lots of cases where its desirable to have this data as a separate column in the zarr array. - E.g. either because the ZARR's are tied to a tiling engine for visualisation, or for ease of providing ML inputs. E.g. currently we are storing about 0.3 PetaBytes of Zarr arrays, and about 20% of these is bands (arrays) such as these.
I had a look to see if there was a way to implement this in the current extension points, but I couldn't see one. I suspect the group sharding extension might bring this closer to feasability though, e.g. something along the lines of the following:
Anyway, mostly just wondering if anything like this is a possibility. I'd be happy to help work on it if it was.
Beta Was this translation helpful? Give feedback.
All reactions