Reconsider `FieldSet.from_xarray_dataset()` #1940

VeckoTheGecko · 2025-03-18T16:35:06Z

FieldSet.from_xarray_dataset() fundamentally works off of the assumption that a single dataset contains all field information, which may not be correct as the field information can be scattered across multiple files which have different dimensions.

I'm not sure how useful this abstraction is as a method due to this difference. May be worth considering removing outright, or just clearly documenting that its limited in scope.

Removal can be done at a later stage (removing this method now will just interfere with test cases that currently use it)

The text was updated successfully, but these errors were encountered:

fluidnumerics-joe · 2025-03-18T16:59:31Z

With us moving towards xarray/uxarray adoption, this probably won't be needed.

This being said, the assumption that a xarray.Dataset containing all field information may not be such a bad assumption. As I understand it, and xarray.Dataset is comprised of one or more xarray.DataArray's with each xarray.DataArray representing a field with coordinates and dimensions. open_mfdataset can load data arrays from multiple files and combine them into a single data set.

VeckoTheGecko · 2025-03-19T07:44:56Z

I see. I hadn't used open_mfdataset for combining along non-time dimensions, so I'm not entirely sure how it would work when U and V are stored in separate files. I think that you're right, we can investigate what this would look like v4

fluidnumerics-joe · 2025-03-19T13:51:00Z

Documentation suggests that open_mfdataset (by default) will combine the datasets all the files into a single dataset using combine_by_coords. This bit from combine_by_coords docs I think sums up the behavior nicely

Attempt to auto-magically combine the given datasets (or data arrays) into one by using dimension coordinates.

This function attempts to combine a group of datasets along any number of dimensions into a single entity by inspecting coords and metadata and using a combination of concat and merge.

Will attempt to order the datasets such that the values in their dimension coordinates are monotonic along all dimensions.

In the example I wrote (under the "Re-working our example" header), the files that @danliba provided have u, v, and w stored in separate files and this works just fine

import uxarray as ux

grid_path="./data/channel_lizarbe/fesom.mesh.diag.nc"
data_path=["./data/channel_lizarbe/u.fesom.2005_cut.nc",
           "./data/channel_lizarbe/v.fesom.2005_cut.nc",
           "./data/channel_lizarbe/w.fesom.2005_cut.nc"]

uxds = ux.open_mfdataset(grid_path,data_path)

VeckoTheGecko mentioned this issue Mar 18, 2025

Create an inventory of features to drop in v4 #1844

Open

24 tasks

github-project-automation bot added this to Parcels development Mar 18, 2025

github-project-automation bot moved this to Backlog in Parcels development Mar 18, 2025

VeckoTheGecko changed the title ~~FieldSet.from_xarray_dataset()~~ Reconsider FieldSet.from_xarray_dataset() Mar 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reconsider `FieldSet.from_xarray_dataset()` #1940

Reconsider `FieldSet.from_xarray_dataset()` #1940

VeckoTheGecko commented Mar 18, 2025 •

edited

Loading

fluidnumerics-joe commented Mar 18, 2025

VeckoTheGecko commented Mar 19, 2025

fluidnumerics-joe commented Mar 19, 2025

Reconsider FieldSet.from_xarray_dataset() #1940

Reconsider FieldSet.from_xarray_dataset() #1940

Comments

VeckoTheGecko commented Mar 18, 2025 • edited Loading

fluidnumerics-joe commented Mar 18, 2025

VeckoTheGecko commented Mar 19, 2025

fluidnumerics-joe commented Mar 19, 2025

Reconsider `FieldSet.from_xarray_dataset()` #1940

Reconsider `FieldSet.from_xarray_dataset()` #1940

VeckoTheGecko commented Mar 18, 2025 •

edited

Loading