Clean mt5 #419

xiezipeng-ML · 2022-11-03T08:10:02Z

clean mt5，默认fuse_multihead_att和fuse_softmax，注释了model_type='t5'相关内容

xiezipeng-ML · 2022-11-04T10:19:30Z

projects/T5/models/attention.py

- # [bsz, tgt_len, num_heads, head_size] -> [bsz, tgt_len, num_heads * head_size]
- # SBP sign: [S(0), S(2)]
- # [S(0), S(2)] x [B, S(0)] = [S(0), P] -> [S(0), B]
+ context = flow._C.transpose(context, perm=(2, 0, 1, 3))


关于今天提到的这里有个transpose操作的原因：

attention_scores, value = flow._C.fused_self_attention(...)，这个fused_self_attention返回的tensor的shape是attention_scores：[batch_size, num_head, seq_len1, seq_len2]以及value：[bsz, num_head, seq_len, head_size]，所以这里0维就是batch_size

然后这里的context是由context = flow.matmul(attention_weights, value)得到，所以context的shape是 [bsz, num_head, seq_len, head_size]，所以最后需要有这个transpose的操作，没改之前也有transpose的操作：context = context.transpose(1, 2)，只不过现在交换了3个维度( context = flow._C.transpose(context, perm=(2, 0, 1, 3)))

* fix api * fix api

xiezipeng-ML added 2 commits November 3, 2022 07:40

fuse_multihead + fues_softmax

a358f32

clean mt5

6cb2e11

xiezipeng-ML requested review from strint, chengtbf, CPFLAME and ouyangyu November 3, 2022 08:10

This comment was marked as resolved.

Sign in to view

fix logits

fc949b3

xiezipeng-ML commented Nov 4, 2022

View reviewed changes

xiezipeng-ML mentioned this pull request Nov 7, 2022

Use fused gelu mul #420

Closed

xiezipeng-ML and others added 4 commits November 8, 2022 12:15

add model unitest

039c5e7

refine

13bd819

use fuse gelu mul

d78d88e

reformat

4f1f3b5

xiezipeng-ML requested a review from leaves-zwx November 8, 2022 09:26

xiezipeng-ML and others added 2 commits November 15, 2022 11:54

update rms_norm

8a30a80

Merge branch 'main' into clean_mt5

9a1dfc9

xiezipeng-ML requested a review from oneflow-ci-bot November 17, 2022 08:01

reformat

2fe829a

xiezipeng-ML requested review from oneflow-ci-bot and removed request for oneflow-ci-bot November 17, 2022 08:06

strint and others added 3 commits November 17, 2022 17:29

Fix clean mt5 (#430)

5cfff4a

* fix api * fix api

refine

7da3a99

refine trannspose

4116090

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean mt5 #419

Clean mt5 #419

xiezipeng-ML commented Nov 3, 2022

This comment was marked as resolved.

xiezipeng-ML Nov 4, 2022

Clean mt5 #419

Are you sure you want to change the base?

Clean mt5 #419

Conversation

xiezipeng-ML commented Nov 3, 2022

This comment was marked as resolved.

xiezipeng-ML Nov 4, 2022

Choose a reason for hiding this comment