WSI reader improvements #218

stergioc · 2025-02-24T23:14:10Z

Added

slideio backend for WSI is now available and allows loading among others the Olympus .vsi format
nn.DataParallel for WSI inference now allows using multiple GPUs and larger batch_sizes

Changed

Removed chunking for the io.wsi reader (leads x10 speedup when reading tiles from the sdata)

This reverts commit 438e38c.

…n#1454" This reverts commit 37d367a.

quentinblampey

Nice @stergioc! I added two comments (questions, actually)

quentinblampey · 2025-02-25T13:24:35Z

sopa/io/reader/_openslide.py

@@ -79,6 +79,16 @@ def __init__(self, path: str, tilesize: int = 512):
        self._writeable = False
        self._erasable = False

+    def __contains__(self, key: str):


Why is this needed here?

quentinblampey · 2025-02-25T13:26:14Z

sopa/io/reader/wsi.py

@@ -36,9 +35,9 @@ def wsi(
        scale_image = DataArray(
            img[key].transpose("S", f"Y{suffix}", f"X{suffix}"),
            dims=("c", "y", "x"),
-        ).chunk(chunks)
+        )


Do we still have chunks (since you removed .chunk(chunks))? Do we sometimes have images without any chunks?

stergioc · 2025-02-25T19:49:10Z

While trying to answer your questions I noticed that even if this branch passes all tests it fails when used for normal WSIs (not the small ones I used for testing). It seems that the Image2DModel.parse function that follows throws an memory error (CMU-3.svs):

numpy._core._exceptions._ArrayMemoryError: Unable to allocate 67.0 GiB for an array with shape (3, 45402, 66000) and data type int64

I will take a closer look at this because there is a ~7-10x drop in reading time when using sopa vs the native read_region from the different backends.

stergioc and others added 23 commits February 7, 2025 00:20

Added H-Optimus-0

438e38c

Merge branch 'gustaveroussy:master' into wsi

c34dc6d

adding slideio backend (does not work as expected for some extensions)

82e5358

fixed slideio reader for border tiles with lower dim than available

68a3116

Revert "Added H-Optimus-0"

a07b77c

This reverts commit 438e38c.

Merge branch 'gustaveroussy:master' into slideio

40aab06

performance improvements following zarr-developers/zarr-python#1454

37d367a

Merge branch 'gustaveroussy:master' into slideio

3167c93

Revert "performance improvements following zarr-developers/zarr-pytho…

3f20883

…n#1454" This reverts commit 37d367a.

Merge branch 'gustaveroussy:master' into slideio

c526add

Merge branch 'slideio' of github.com:stergioc/sopa into slideio

973e5ee

Enable nn.DataParallel for the inference

85e9326

adds slideio at the function sign

311d720

Merge branch 'master' into slideio

1e00201

adds __contains__ function

0df5b5e

adds slideio contains

dc33acf

Merge branch 'gustaveroussy:master' into slideio

9424189

removes chunks for wsi reader -> x10 speed gains

8e45068

Merge branch 'slideio' of github.com:stergioc/sopa into slideio

f6cb649

Merge branch 'Adds-nn.DataParallel' into wsi-reader-improvements

3d8bd3b

Updating CHANGELOG

5b5e302

minor changes to the CHANGELOG

26a5fbe

Updated docstrings

e22c1e1

stergioc changed the title ~~Wsi reader improvements~~ WSI reader improvements Feb 25, 2025

xarray mask_and_scale to False

97f7f88

stergioc mentioned this pull request Feb 25, 2025

tiffslide backend slower than openslide backend stergioc/sopa#2

Closed

quentinblampey reviewed Feb 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WSI reader improvements #218

WSI reader improvements #218

stergioc commented Feb 24, 2025

quentinblampey left a comment

quentinblampey Feb 25, 2025

quentinblampey Feb 25, 2025

stergioc commented Feb 25, 2025

WSI reader improvements #218

Are you sure you want to change the base?

WSI reader improvements #218

Conversation

stergioc commented Feb 24, 2025

Added

Changed

quentinblampey left a comment

Choose a reason for hiding this comment

quentinblampey Feb 25, 2025

Choose a reason for hiding this comment

quentinblampey Feb 25, 2025

Choose a reason for hiding this comment

stergioc commented Feb 25, 2025