Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training Llava-OV error #2843

Open
rover5056 opened this issue Jan 2, 2025 · 4 comments
Open

Training Llava-OV error #2843

rover5056 opened this issue Jan 2, 2025 · 4 comments

Comments

@rover5056
Copy link

image

模型结构和 llava-ov 完全一样,LLM 用的是 qwen2.5,load 的时候会报错

AttributeError: Qwen2TokenizerFast has no attribute tokenizer. Did you mean: '_tokenizer'?

求问是需要修改 config 么

@Jintao-Huang Jintao-Huang added the bug Something isn't working label Jan 3, 2025
@Jintao-Huang
Copy link
Collaborator

shell发我一下

@rover5056
Copy link
Author

rover5056 commented Jan 3, 2025

@Jintao-Huang

就是这个~ 用的是 Ivy-VL-llava 的 ckpt
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 \ NNODES=2 \ NODE_RANK=$RANK \ MASTER_ADDR=$MASTER_ADDR \ MASTER_PORT=$MASTER_PORT \ NPROC_PER_NODE=8 \ swift sft \ --model /mnt/csp/mmvision/share/pretrained_models/Ivy-VL-llava \ --train_type full \ --freeze_aligner false \ --freeze_llm false \ --freeze_vit false \ --model_type llava_onevision_hf \ --output_dir wemm2_qwen2_5_3B_0102_Ivy \ --deepspeed zero2 \ --per_device_train_batch_size 4 \ --per_device_eval_batch_size 4 \ --warmup_ratio 0.03 \ --dataset_num_proc 32 \ --save_steps 0.3 \ --num_train_epochs 1 \ --learning_rate 1e-5 \ --torch_dtype bfloat16 \ --max_length 5120 \ --truncation_strategy delete \ --gradient_accumulation_steps 16 \ --eval_steps 0.2 \ --disable_tqdm True \ --lazy_tokenize True \ --report_to wandb \ --dataset data_anns/665k_filtered.json

@Jintao-Huang Jintao-Huang removed the bug Something isn't working label Jan 3, 2025
@Jintao-Huang
Copy link
Collaborator

这里的processor为什么是个tokenizer哇,你检查一下这里

@rover5056
Copy link
Author

rover5056 commented Jan 3, 2025

用的是 qwen2.5 的 llm 看着倒也没错? 是这个 repo
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants