[Fix] Improve dim_size handling in SetTransformerAggregation to prevent CUDA crash #10220

KAVYANSHTYAGI · 2025-04-23T10:44:30Z

This PR improves the robustness of SetTransformerAggregation by:

Automatically setting dim_size = index.max() + 1 if dim_size is not provided.
Raising a clear error if index.max() >= dim_size to avoid CUDA crashes during evaluation.

This is helpful especially for datasets like PPI where data.batch may be missing. It replaces hard-to-debug GPU errors with clear and early validation.

[Fix] Add dim_size validation and fallback to SetTransformerAggregation

base repository: pyg-team/pytorch_geometric ← compare: KAVYANSHTYAGI:fix/set-transformer-aggregation-index-check

for more information, see https://pre-commit.ci

rusty1s · 2025-05-20T11:29:45Z

torch_geometric/nn/aggr/set_transformer.py

@@ -94,6 +94,15 @@ def forward(
        max_num_elements: Optional[int] = None,
    ) -> Tensor:

+        if dim_size is None:


I think this is already handled in to_dense_batch.

rusty1s · 2025-05-20T11:30:07Z

torch_geometric/nn/aggr/set_transformer.py

+        if int(index.max()) >= dim_size:
+            raise ValueError(
+                f"SetTransformerAggregation error: index.max() = {int(index.max())}, "
+                f"but dim_size = {dim_size}. This causes an indexing error on GPU. "
+                f"Ensure data.batch is set or dim_size is passed explicitly.")


This leads to a device sync, so we should avoid this.

…im-check-v2

Fix: Remove device sync and dim_size fallback in SetTransformerAggregation - Removed redundant dim_size = index.max() + 1 logic (handled in to_dense_batch). - Added GPU-safe index validation to avoid CUDA crashes.

KAVYANSHTYAGI

Thank you for the feedback!

I've removed the fallback dim_size = index.max() + 1 to avoid redundancy with to_dense_batch, as suggested.

Also eliminated device sync by replacing the .max() check with a GPU-safe tensor comparison using (index >= dim_size).any().

Let me know if any further simplification is preferred!

for more information, see https://pre-commit.ci

KAVYANSHTYAGI added 2 commits April 23, 2025 15:59

Update set_transformer.py

a6602b8

[Fix] Add dim_size validation and fallback to SetTransformerAggregation

Merge pull request #1 from KAVYANSHTYAGI/KAVYANSHTYAGI-patch-1

f3f53f5

base repository: pyg-team/pytorch_geometric ← compare: KAVYANSHTYAGI:fix/set-transformer-aggregation-index-check

KAVYANSHTYAGI requested a review from EdisonLeeeee as a code owner April 23, 2025 10:44

[pre-commit.ci] auto fixes from pre-commit.com hooks

03bcbce

for more information, see https://pre-commit.ci

rusty1s reviewed May 20, 2025

View reviewed changes

KAVYANSHTYAGI added 2 commits May 22, 2025 13:29

Merge branch 'pyg-team:master' into fix/set-transformer-aggregation-d…

d1e5df1

…im-check-v2

Update set_transformer.py

d39a24e

Fix: Remove device sync and dim_size fallback in SetTransformerAggregation - Removed redundant dim_size = index.max() + 1 logic (handled in to_dense_batch). - Added GPU-safe index validation to avoid CUDA crashes.

KAVYANSHTYAGI commented May 22, 2025

View reviewed changes

[pre-commit.ci] auto fixes from pre-commit.com hooks

e3f8abc

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Fix] Improve dim_size handling in SetTransformerAggregation to prevent CUDA crash #10220

[Fix] Improve dim_size handling in SetTransformerAggregation to prevent CUDA crash #10220

Uh oh!

KAVYANSHTYAGI commented Apr 23, 2025

Uh oh!

rusty1s May 20, 2025

Uh oh!

rusty1s May 20, 2025

Uh oh!

KAVYANSHTYAGI left a comment

Uh oh!

Uh oh!

[Fix] Improve dim_size handling in SetTransformerAggregation to prevent CUDA crash #10220

Are you sure you want to change the base?

[Fix] Improve dim_size handling in SetTransformerAggregation to prevent CUDA crash #10220

Uh oh!

Conversation

KAVYANSHTYAGI commented Apr 23, 2025

Uh oh!

rusty1s May 20, 2025

Choose a reason for hiding this comment

Uh oh!

rusty1s May 20, 2025

Choose a reason for hiding this comment

Uh oh!

KAVYANSHTYAGI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!