Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问如何更换自己的数据集 #26

Open
captain2017 opened this issue Jul 30, 2021 · 5 comments
Open

请问如何更换自己的数据集 #26

captain2017 opened this issue Jul 30, 2021 · 5 comments

Comments

@captain2017
Copy link

python -m examples.text_keyphrase
请问如何更换自己的数据集,我直接在后面加数据是不行的吗,还是需要标注或者关键词分数?
image

image

@changzong
Copy link
Contributor

可以更换的,只要保证跟样例数据的格式一致就可以

@captain2017
Copy link
Author

必须每个都要标注过吗

@captain2017
Copy link
Author

不能直接在后面加吗,还有如果要做预测新样例,应该怎么调用。

@changzong
Copy link
Contributor

不需要的,关键短语抽取这个任务,数据集只要跟 data/patent-text/train 这个文件中的格式一样就行,每一行是一篇专利的数据,格式为:标题@@摘要

@captain2017
Copy link
Author

valid文件好像不预测,出的都是train文件预测结果。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants