Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

训练qwen1.5-moe-A2.7B-chat速度缓慢,GPU利用率低 #868

Closed
yangzhipeng1108 opened this issue May 6, 2024 · 1 comment
Closed

Comments

@yangzhipeng1108
Copy link

yangzhipeng1108 commented May 6, 2024

CUDA_VISIBLE_DEVICES=0
python3 llm_sft.py
--model_type qwen1half-moe-a2_7b-chat
--model_id_or_path /root/yovole/qwen/Qwen1.5-MoE-A2.7B-Chat
--sft_type lora
--tuner_backend swift
--dtype AUTO
--output_dir output
--dataset dureader-robust-zh
--train_dataset_sample 10000
--num_train_epochs 1
--max_length 1024
--check_dataset_strategy warning
--lora_rank 8
--lora_alpha 32
--lora_dropout_p 0.05
--lora_target_modules ALL
--gradient_checkpointing true
--batch_size 1
--weight_decay 0.1
--learning_rate 1e-4
--gradient_accumulation_steps 16
--max_grad_norm 0.5
--warmup_ratio 0.03
--eval_steps 100
--save_steps 100
--save_total_limit 2
--logging_steps 10
--use_flash_attn true
--self_cognition_sample 1000
--custom_train_dataset_path /root/yovole/qwen/data/alpaca-gpt4-data-zh/alpaca_gpt4_data_zh.json
--custom_val_dataset_path /root/yovole/qwen/data/alpaca-gpt4-data-zh/alpaca_gpt4_data_zh.json
--model_name 卡卡罗特
--model_author 陶白白

image

image

@tastelikefeet
Copy link
Collaborator

这个慢的不正常,看下模型device,是不是有offloading到cpu的情况

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants