[LLM] support QWen1.5-Moe #8338

DrownFish19 · 2024-04-28T08:45:04Z

PR types

New features

PR changes

Models

Description

add QWen1.5 Moe model.
support same prefix for different models, such as QWen and QWen2Moe with same prefix QWen. The longest name will match each model name before others.
support sft and lora.

…-moe

paddle-bot · 2024-04-28T08:45:09Z

Thanks for your contribution!

…P into dev_add_qwen1.5-moe

codecov · 2024-05-06T03:46:48Z

Codecov Report

Attention: Patch coverage is 66.44370% with 301 lines in your changes are missing coverage. Please review.

Project coverage is 54.08%. Comparing base (0087c4a) to head (d57a5b1).

❗ Current head d57a5b1 differs from pull request most recent head bfb65a1

Please upload reports for the commit bfb65a1 to get more accurate results.

Files	Patch %	Lines
paddlenlp/transformers/qwen2moe/modeling.py	72.29%	197 Missing ⚠️
paddlenlp/transformers/qwen2moe/tokenizer.py	22.38%	104 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8338      +/-   ##
===========================================
+ Coverage    53.96%   54.08%   +0.12%     
===========================================
  Files          618      622       +4     
  Lines        96827    97722     +895     
===========================================
+ Hits         52256    52857     +601     
- Misses       44571    44865     +294

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DesmonDay · 2024-05-24T07:34:22Z

paddlenlp/transformers/qwen2moe/__init__.py

+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+from .configuration import QWen2MoeConfig


QWen2MoEConfig会不会更好，把Moe都改成MoE。

DesmonDay · 2024-05-24T09:00:55Z

paddlenlp/transformers/qwen2moe/modeling.py

@@ -0,0 +1,1580 @@
+# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.


2023 -> 2024，都改掉吧

ZHUI · 2024-05-08T11:33:23Z

llm/qwen2moe/lora_argument.json

+{
+ "model_name_or_path": "qwen/Qwen1.5-MoE-A2.7B",
+ "dataset_name_or_path": "./data",
+ "output_dir": "./checkpoints/qwen2moe_lora_ckpts",


确认是否ok，并同步更新 readme 文档

ZHUI · 2024-05-08T11:34:31Z

paddlenlp/transformers/qwen2moe/__init__.py

+from .configuration import QWen2MoeConfig
+from .modeling import QWen2MoeForCausalLM
+from .tokenizer import QWen2MoeTokenizer


Suggested change

from .configuration import QWen2MoeConfig

from .modeling import QWen2MoeForCausalLM

from .tokenizer import QWen2MoeTokenizer

from .configuration import *

from .modeling import *

from .tokenizer import*

ZHUI · 2024-05-08T11:35:09Z

paddlenlp/transformers/__init__.py

+from .qwen2moe.modeling import *
+from .qwen2moe.configuration import *
+from .qwen2moe.tokenizer import *


Suggested change

from .qwen2moe.modeling import *

from .qwen2moe.configuration import *

from .qwen2moe.tokenizer import *

from .qwen2moe import *

ZHUI · 2024-05-08T11:37:21Z

tests/transformers/qwen2moe/__init__.py

@@ -0,0 +1,13 @@
+# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.


这个文件需要吗？

DrownFish19 added 23 commits April 17, 2024 10:58

add Qwen2Moe

36ab9a7

update default config

3913e11

Merge remote-tracking branch 'paddlenlp/develop' into dev_add_qwen1.5…

0aa1aca

…-moe

update QWen2Moe modeling

a29e90d

update modeling

d514dff

update ckpt name

1e98323

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

f81bb43

support same prefix model name for auto modeling

37dd2d5

update qwen2moe testing

d12938a

update qwen2moe modeling and config

8cc49fc

update qwen2moe import

9c8222e

fix mlp hidden_size

4d6ff87

update qkv bias convert

f350a2f

update modeling init_weight

c53690d

update _get_name_mappings

9d12995

update _get_name_mappings and _init_weight

dba0f74

add tokenizer

e487606

update modeling

cd9c753

update modeling

10407c4

update tokenizer

beb0f4c

update modeling and tokenizer

beefee9

fix index_add_ error

82ba345

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

d522ee4

DrownFish19 added 4 commits April 28, 2024 11:08

fix

4a1b2e3

Merge branch 'dev_add_qwen1.5-moe' of github.com:DrownFish19/PaddleNL…

526a9db

…P into dev_add_qwen1.5-moe

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

0c9d5ec

update comments

2bb3aba

update lora weights

f203983

add todo

58af3ec

ZHUI closed this May 24, 2024

ZHUI reopened this May 24, 2024

DesmonDay reviewed May 24, 2024

View reviewed changes

DrownFish19 added 6 commits May 29, 2024 10:49

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

c766eb5

update Copyright

5ddc326

update Moe to MoE

de1db67

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

10a194c

update comment

87f0276

update Copyright

8d9970b

ZHUI reviewed Jun 3, 2024

View reviewed changes

DrownFish19 added 3 commits June 3, 2024 15:45

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

89994a6

update readme and json

d57a5b1

update __init__.py

bfb65a1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] support QWen1.5-Moe #8338

[LLM] support QWen1.5-Moe #8338

DrownFish19 commented Apr 28, 2024 •

edited

paddle-bot bot commented Apr 28, 2024

codecov bot commented May 6, 2024 •

edited

DesmonDay May 24, 2024

DrownFish19 May 29, 2024

DesmonDay May 24, 2024

DrownFish19 May 29, 2024

ZHUI May 8, 2024

ZHUI May 8, 2024

ZHUI May 8, 2024

ZHUI May 8, 2024

		@@ -0,0 +1,1580 @@
		# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.

		@@ -0,0 +1,13 @@
		# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.

[LLM] support QWen1.5-Moe #8338

Are you sure you want to change the base?

[LLM] support QWen1.5-Moe #8338

Conversation

DrownFish19 commented Apr 28, 2024 • edited

PR types

PR changes

Description

paddle-bot bot commented Apr 28, 2024

codecov bot commented May 6, 2024 • edited

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DrownFish19 commented Apr 28, 2024 •

edited

codecov bot commented May 6, 2024 •

edited