Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

想使用GPT2的微调来实现负样本的生成 #124

Open
jazzlee008 opened this issue Oct 8, 2023 · 0 comments
Open

想使用GPT2的微调来实现负样本的生成 #124

jazzlee008 opened this issue Oct 8, 2023 · 0 comments

Comments

@jazzlee008
Copy link

jazzlee008 commented Oct 8, 2023

因为负样本的数量非常的少(只有150条左右,每条对话的长度不会超过100个字,中英文混杂)

preprocessing以后,使用train.py会报错如下,请求帮助。问题是出在哪里?

Traceback (most recent call last):
File "train.py", line 427, in
main()
File "train.py", line 423, in main
train(model, logger, train_dataset, validate_dataset, args)
File "train.py", line 268, in train
train_dataloader = DataLoader(
File "/home/lee/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 351, in init
sampler = RandomSampler(dataset, generator=generator) # type: ignore[arg-type]
File "/home/lee/.local/lib/python3.8/site-packages/torch/utils/data/sampler.py", line 107, in init
raise ValueError("num_samples should be a positive integer "
ValueError: num_samples should be a positive integer value, but got num_samples=0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant