-
Notifications
You must be signed in to change notification settings - Fork 677
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
想使用GPT2的微调来实现负样本的生成 #124
Comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
因为负样本的数量非常的少(只有150条左右,每条对话的长度不会超过100个字,中英文混杂)
preprocessing以后,使用train.py会报错如下,请求帮助。问题是出在哪里?
Traceback (most recent call last):
File "train.py", line 427, in
main()
File "train.py", line 423, in main
train(model, logger, train_dataset, validate_dataset, args)
File "train.py", line 268, in train
train_dataloader = DataLoader(
File "/home/lee/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 351, in init
sampler = RandomSampler(dataset, generator=generator) # type: ignore[arg-type]
File "/home/lee/.local/lib/python3.8/site-packages/torch/utils/data/sampler.py", line 107, in init
raise ValueError("num_samples should be a positive integer "
ValueError: num_samples should be a positive integer value, but got num_samples=0
The text was updated successfully, but these errors were encountered: