How to transpose a tensor and have the transpose reflected in contiguous memory with CUDA backend #684

balisujohn · 2024-01-07T06:29:44Z

balisujohn
Jan 7, 2024

I have a contiguous tensor of shape [18,64,16,4] If I print out the first and last 3 memory locations I get

0.00137997
0.00800999
0.0252284
0.18625
-0.075262
-0.00594251

I want to transpose this tensor and have the transpose reflected in contiguous memory. So I try the code:

KQV = ggml_transpose(ctx0, KQV);
KQV = ggml_cont_3d(ctx0,ggml_view_1d(ctx0, KQV, 64*18*16*4,0),64,18,64);

and I get the same results when printing out the first and last 3 values from memory:

0.00137997
0.00800999
0.0252284
0.18625
-0.075262
-0.00594251

How can I get a transpose that's actually reflected in contiguous memory for the first 2 dimensions of a 4d tensor? I should add that this is with a cuda backend, and ggml_cont_4d() doesn't work with 4d tensors for the cuda backend.

Answered by balisujohn

Jan 7, 2024

This strategy seems to work actually:

            KQV = ggml_reshape_3d(ctx0, KQV, 18,64,64);
            KQV = ggml_permute(ctx0, KQV, 1,0,2,3);
            KQV = ggml_cont_3d(ctx0, KQV, 64,18,64);
            KQV = ggml_reshape_4d(ctx0, KQV, 64,18,16,4);

View full answer

balisujohn · 2024-01-07T06:37:17Z

balisujohn
Jan 7, 2024
Author

This strategy seems to work actually:

            KQV = ggml_reshape_3d(ctx0, KQV, 18,64,64);
            KQV = ggml_permute(ctx0, KQV, 1,0,2,3);
            KQV = ggml_cont_3d(ctx0, KQV, 64,18,64);
            KQV = ggml_reshape_4d(ctx0, KQV, 64,18,16,4);

2 replies

balisujohn Jan 7, 2024
Author

~~(though im not getting exactly the transpose I want still even when I tried 1,0,2,3, but that's a separate issue)~~
updated the version above so it now works correctly!

ggerganov Jan 7, 2024
Maintainer

KQV = ggml_cont(ggml_transpose(ctx0, KQV));

should also work

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to transpose a tensor and have the transpose reflected in contiguous memory with CUDA backend #684

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

How to transpose a tensor and have the transpose reflected in contiguous memory with CUDA backend #684

balisujohn Jan 7, 2024

Replies: 1 comment · 2 replies

balisujohn Jan 7, 2024 Author

balisujohn Jan 7, 2024 Author

ggerganov Jan 7, 2024 Maintainer

balisujohn
Jan 7, 2024

Replies: 1 comment 2 replies

balisujohn
Jan 7, 2024
Author

balisujohn Jan 7, 2024
Author

ggerganov Jan 7, 2024
Maintainer