Restormer Implementation #8312

phisanti · 2025-01-23T14:51:24Z

Fixes # .

Description

This PR implements the Restormer architecture for high-resolution image restoration in MONAI following the discussion in issue #8261. The implementation supports both 2D and 3D images using MONAI's convolution as the base. Key additions include:

Downsample class for efficient downsampling operations
pixel_unshuffle operation complementing existing pixel_shuffle
Channel Attention Block (CABlock) with FeedForward layer
Multi-DConv Head Transposed Self-Attention (MDTA)
OverlapPatchEmbed class
Comprehensive unit tests for all new components

The implementation follows MONAI's coding patterns and includes performance validations against native PyTorch operations where applicable.

Types of changes

Non-breaking change (fix or new feature that would not break existing functionality).
Breaking change (fix or new feature that would cause existing functionality to change).
New tests added to cover the changes.
Integration tests passed locally by running ./runtests.sh -f -u --net --coverage.
Quick tests passed locally by running ./runtests.sh --quick --unittests --disttests.
In-line docstrings updated.
Documentation updated, tested make html command in the docs/ folder.

…nsample class alias

…pass ./runtests.sh -f -u --net --coverage

for more information, see https://pre-commit.ci

ericspod

Looks good overall but I had a few inline comments, and we should have full docstrings everywhere appropriate. For any classes meant for general purpose use (ie. not just by Restormer) please ensure they have docstring descriptions for the arguments (at the very least for constructor args). Thanks!

ericspod · 2025-01-24T12:18:35Z

monai/networks/utils.py

+    See: Aitken et al., 2017, "Checkerboard artifact free sub-pixel convolution".
+
+    Args:
+        x: Input tensor


Here we should specifically state that x has shape BCHW[D].

ericspod · 2025-01-24T12:22:03Z

monai/networks/utils.py

+
+    if any(d % factor != 0 for d in input_size[2:]):
+        raise ValueError(
+            f"All spatial dimensions must be divisible by factor {factor}. " f"Got spatial dimensions: {input_size[2:]}"


Suggested change

f"All spatial dimensions must be divisible by factor {factor}. " f"Got spatial dimensions: {input_size[2:]}"

f"All spatial dimensions must be divisible by {factor}, spatial shape is: {input_size[2:]}"

Maybe a little shorter?

ericspod · 2025-01-24T12:36:58Z

monai/networks/blocks/downsample.py

+            kernel_size_ = ensure_tuple_rep(kernel_size, spatial_dims)
+            padding = tuple((k - 1) // 2 for k in kernel_size_)
+
+        if down_mode == "conv":


Suggested change

if down_mode == "conv":

if down_mode == DownsampleMode.CONV:

ericspod · 2025-01-24T12:37:20Z

monai/networks/blocks/downsample.py

+                    bias=bias,
+                ),
+            )
+        elif down_mode == "convgroup":


Suggested change

elif down_mode == "convgroup":

elif down_mode == DownsampleMode.CONVGROUP:

ericspod · 2025-01-24T12:58:07Z

monai/networks/blocks/downsample.py

+            if post_conv:
+                self.add_module("postconv", post_conv)
+
+        elif down_mode == "pixelunshuffle":


Suggested change

elif down_mode == "pixelunshuffle":

elif down_mode == DownsampleMode.PIXELSHUFFLE:

done, but I used DownsampleMode.PIXELUNSHUFFLE as in downsampling the restormer uses pixel_unshuffling while pixel_shuffling is reserved for upsampling.

ericspod · 2025-01-24T13:09:24Z

monai/networks/blocks/cablock.py

+    """Multi-DConv Head Transposed Self-Attention (MDTA): Differs from standard self-attention
+    by operating on feature channels instead of spatial dimensions. Incorporates depth-wise
+    convolutions for local mixing before attention, achieving linear complexity vs quadratic
+    in vanilla attention. Based on SW Zamir, et al., 2022 <https://arxiv.org/abs/2111.09881>"""
+


We should have a full docstring here describing the arguments for the constructor, and in the previous class.

ericspod · 2025-01-24T13:25:53Z

monai/networks/nets/restormer.py

+class OverlapPatchEmbed(nn.Module):
+    """Initial feature extraction using overlapped convolutions.
+    Unlike standard patch embeddings that use non-overlapping patches,
+    this approach maintains spatial continuity through 3x3 convolutions."""
+
+    def __init__(self, spatial_dims: int, in_c: int = 3, embed_dim: int = 48, bias: bool = False):
+        super().__init__()
+        self.proj = Convolution(
+            spatial_dims=spatial_dims,
+            in_channels=in_c,
+            out_channels=embed_dim,
+            kernel_size=3,
+            strides=1,
+            padding=1,
+            bias=bias,
+            conv_only=True,
+        )
+
+    def forward(self, x: torch.Tensor) -> torch.Tensor:
+        return self.proj(x)


Suggested change

class OverlapPatchEmbed(nn.Module):

"""Initial feature extraction using overlapped convolutions.

Unlike standard patch embeddings that use non-overlapping patches,

this approach maintains spatial continuity through 3x3 convolutions."""

def __init__(self, spatial_dims: int, in_c: int = 3, embed_dim: int = 48, bias: bool = False):

super().__init__()

self.proj = Convolution(

spatial_dims=spatial_dims,

in_channels=in_c,

out_channels=embed_dim,

kernel_size=3,

strides=1,

padding=1,

bias=bias,

conv_only=True,

)

def forward(self, x: torch.Tensor) -> torch.Tensor:

return self.proj(x)

class OverlapPatchEmbed(Convolution):

"""

Initial feature extraction using overlapped convolutions. Unlike standard patch embeddings

that use non-overlapping patches, this approach maintains spatial continuity through 3x3 convolutions.

"""

def __init__(self, spatial_dims: int, in_c: int = 3, embed_dim: int = 48, bias: bool = False):

super().__init__(

spatial_dims=spatial_dims,

in_channels=in_c,

out_channels=embed_dim,

kernel_size=3,

strides=1,

padding=1,

bias=bias,

conv_only=True,

)

Would it work to inherit directly from Convolution?

Works! very elegant suggestion btw!

aylward

Thank you for this outstanding contribution!

aylward · 2025-02-02T18:35:59Z

monai/networks/blocks/downsample.py

+        return x
+
+
+Downsample = DownSample


@ericspod - do we normally provide alternative capitalizations to functions? "Downsample" is the generally accepted term (vs "Down Sample" which is less common).

I suggest using "Downsample" throughout, unless we offer alternative usage elsewhere that I haven't encountered. IDE auto-complete can help folks get the right capitalization.

Looking at your enums, you use "Downsample" which confirms (in my mind) that we should be using "Downsample" throughout.

We have had this mechanism for other classes so other capitalisation could be used in scripts, such as "Transformd" and "TransformD". Here I'd say we don't need it though.

Here, I mirrored the naming style of the Upsample class. Happy either way to remove it or keep it:

monai/networks/blocks/upsample.py

Upsample = UpSample Subpixelupsample = SubpixelUpSample = SubpixelUpsample

aylward · 2025-02-02T18:36:58Z

monai/networks/nets/restormer.py

+    Unlike standard patch embeddings that use non-overlapping patches,
+    this approach maintains spatial continuity through 3x3 convolutions."""
+
+    def __init__(self, spatial_dims: int, in_c: int = 3, embed_dim: int = 48, bias: bool = False):


Spell-out in_c to in_channels

aylward · 2025-02-02T18:38:09Z

monai/networks/nets/restormer.py

+    def __init__(
+        self,
+        spatial_dims=2,
+        inp_channels=3,


in_channels

aylward · 2025-02-02T18:41:14Z

monai/networks/nets/restormer.py

+        num_refinement_blocks=4,
+        ffn_expansion_factor=2.66,
+        bias=False,
+        LayerNorm_type="WithBias",


Make enum or convert to Bool (e.g., layer_norm_use_bias).

aylward · 2025-02-02T18:48:42Z

tests/test_CABlock.py

+
+
+if __name__ == "__main__":
+    unittest.main()


Looks good .... as long as the runtime is reasonable.

I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 3db93ce I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 9693e04 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: a89f299 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 450691f I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: d0920d8 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 1a48d4d I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: fe47807 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 86155cd I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 137a7f2 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: fb17baf I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 5ff0baa I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 2566db1 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: ac4047b I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 2b74270 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 9b74533 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 1ab34f6 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 4f4c62c I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 068688f I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: e2e1070 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 35c7ee4 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: d8cb6c1 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 6d96816 I, tisalon <[email protected]>, hereby add my Signed-off-by to this commit: 8a688fb Signed-off-by: tisalon <[email protected]>

Signed-off-by: tisalon <[email protected]>

Fixes Project-MONAI#8298 ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [ ] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: YunLiu <[email protected]> Co-authored-by: Eric Kerfoot <[email protected]>

Fixes Project-MONAI#8267 . ### Description Fix channel-wise intensity normalization for integer type inputs. ### Types of changes  - [ ] Non-breaking change (fix or new feature that would not break existing functionality). - [x] Breaking change (fix or new feature that would cause existing functionality to change). - [x] New tests added to cover the changes. - [x] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [x] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [x] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: advcu987 <[email protected]> Signed-off-by: advcu <[email protected]> Co-authored-by: Eric Kerfoot <[email protected]>

Fixes Project-MONAI#8306 This previous api has been deprecated, update based on: https://docs.ngc.nvidia.com/api/?urls.primaryName=Private%20Artifacts%20(Models)%20API#/artifact-file-controller/downloadAllArtifactFiles ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [ ] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: YunLiu <[email protected]>

Fixes Project-MONAI#8298 ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [ ] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: YunLiu <[email protected]> Co-authored-by: Eric Kerfoot <[email protected]>

Related to Project-MONAI#8241 . ### Description A few sentences describing the changes proposed in this pull request. ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [ ] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: Yiheng Wang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

) Fixes Project-MONAI#8298. ### Description This includes the tests for the `compressor` argument when testing with Zarr before version 3.0 when this argument was deprecated. A fix to upgrade the version of `pycln` used is also included. The version of PyTorch is also fixed to below 2.6 to avoid issues with misuse of `torch.load` which must be addressed later. ### Types of changes  - [x] Non-breaking change (fix or new feature that would not break existing functionality). - [ ] Breaking change (fix or new feature that would cause existing functionality to change). - [ ] New tests added to cover the changes. - [ ] Integration tests passed locally by running `./runtests.sh -f -u --net --coverage`. - [ ] Quick tests passed locally by running `./runtests.sh --quick --unittests --disttests`. - [ ] In-line docstrings updated. - [ ] Documentation updated, tested `make html` command in the `docs/` folder. --------- Signed-off-by: Eric Kerfoot <[email protected]>

…ns and simplify ValueError message in pixelunshuffle

…n Restormer model and update assert in forward layer to support 3D images

… Restormer class.

… forward method

…ument descriptions and error handling details.

…ted changes Signed-off-by: tisalon <[email protected]>

Signed-off-by: tisalon <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: tisalon <[email protected]>

phisanti and others added 24 commits January 15, 2025 09:18

Add new pixel unshuffle for SubPixelDownsample class

3db93ce

Add unit test for pixelunshuffle

9693e04

Add DownSample Modes

a89f299

expand pixelunshuffle for 3D

450691f

increase testing for pixelunshuffle

d0920d8

expand pixelunshuffle for 3D images

1a48d4d

add SubpixelDownsample and tests

fe47807

Add DownSample Class

86155cd

Add tests for Downsample

137a7f2

add exports to __init__

fb17baf

Include test to compare with Conv + unshuffle from original restormer

5ff0baa

remove relative imports

2566db1

Create restormer with Downsampler/Upsampler using monai implementation

ac4047b

Add channel attention block

2b74270

add assembled restormer with MONAI convs for 3D

9b74533

restormer adapted for 2D/3D

1ab34f6

Add unit test for CABlock and the FeedForward layers

4f4c62c

remove relative imports

068688f

rename restormer

e2e1070

add unit test restormer

35c7ee4

Update documentation and imports for CABlock and FeedForward; add Dow…

d8cb6c1

…nsample class alias

Add licence to pixel_unshuffle test

6d96816

Refactor imports and clean up whitespace in utils and test files and …

8a688fb

…pass ./runtests.sh -f -u --net --coverage

[pre-commit.ci] auto fixes from pre-commit.com hooks

acb818d

for more information, see https://pre-commit.ci

ericspod requested review from ericspod, Nic-Ma, KumoLiu, yiheng-wang-nv and Can-Zhao January 24, 2025 12:15

ericspod reviewed Jan 24, 2025

View reviewed changes

ericspod requested a review from aylward January 24, 2025 13:35

aylward requested changes Feb 2, 2025

View reviewed changes

phisanti and others added 28 commits February 7, 2025 13:22

add optional_import to downsample block test

c7b1af4

Signed-off-by: tisalon <[email protected]>

rename args and fix imports

8faa5da

Sync dev branch with upstream MONAI changes

61efefb

Clarify input tensor shape in pixelshuffle and pixelunshuffle functio…

091887b

…ns and simplify ValueError message in pixelunshuffle

Refactor downsample mode checks to use enum values for clarity

5d162d0

fix optiona import

f520e99

Refactor layer normalization parameters for consistency and clarity i…

39d1edf

…n Restormer model and update assert in forward layer to support 3D images

Enhance documentation for MDTATransformerBlock, OverlapPatchEmbed and…

5b3d4e1

… Restormer class.

run ./runtests.sh --autofix to check formatting

1683b14

Refactor OverlapPatchEmbed to inherit from Convolution and streamline…

232be1c

… forward method

Enhance documentation for FeedForward and CABlock classes, adding arg…

d1df8e6

…ument descriptions and error handling details.

code formatting

78ce56b

Update args naming in unit restormer test for consistency with sugges…

64b203d

…ted changes Signed-off-by: tisalon <[email protected]>

Fix optional import

ce15886

require einops for all tests

30fad17

require einops also for test_restormer

1079d8c

Signed-off-by: tisalon <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

b2b3ddf

for more information, see https://pre-commit.ci

remove relative impots

174e968

Signed-off-by: tisalon <[email protected]>

fix capitalisation in DownSample documentation networks.rts

e15a815

Signed-off-by: tisalon <[email protected]>

fix capitalisation in SubpixelDownsample documentation

d53d97d

Signed-off-by: tisalon <[email protected]>

formatting

cae7d96

Signed-off-by: tisalon <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Restormer Implementation #8312

Restormer Implementation #8312

phisanti commented Jan 23, 2025

ericspod left a comment

ericspod Jan 24, 2025

ericspod Jan 24, 2025

phisanti Feb 7, 2025

ericspod Jan 24, 2025

ericspod Jan 24, 2025

ericspod Jan 24, 2025

phisanti Feb 7, 2025

ericspod Jan 24, 2025

phisanti Feb 7, 2025

ericspod Jan 24, 2025

phisanti Feb 7, 2025

aylward left a comment

aylward Feb 2, 2025

ericspod Feb 3, 2025

phisanti Feb 7, 2025

aylward Feb 2, 2025

phisanti Feb 7, 2025

aylward Feb 2, 2025

phisanti Feb 7, 2025

aylward Feb 2, 2025

aylward Feb 2, 2025

	f"All spatial dimensions must be divisible by factor {factor}. " f"Got spatial dimensions: {input_size[2:]}"
	f"All spatial dimensions must be divisible by {factor}, spatial shape is: {input_size[2:]}"

	elif down_mode == "convgroup":
	elif down_mode == DownsampleMode.CONVGROUP:

	elif down_mode == "pixelunshuffle":
	elif down_mode == DownsampleMode.PIXELSHUFFLE:

Restormer Implementation #8312

Are you sure you want to change the base?

Restormer Implementation #8312

Conversation

phisanti commented Jan 23, 2025

Description

Types of changes

ericspod left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aylward left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment