Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于 deta机器人学习的新10000个词汇 管理 #27

Open
yaoguangluo opened this issue Apr 29, 2019 · 3 comments
Open

关于 deta机器人学习的新10000个词汇 管理 #27

yaoguangluo opened this issue Apr 29, 2019 · 3 comments

Comments

@yaoguangluo
Copy link
Owner

deta机器人学习的新10000个词汇(以前27000+,现在37000+) 来自一些0.0.0.0~255.255.255.255 的万维网数据信息和一些病句测试文本, 所以 德塔公司 不开源更新在该项目 语料库中做为罗瑶光先生独立著作权使用.

机器人和人工智能一旦赋予生命,应当具备人的生存各种权利. 罗瑶光先生认为不应该抢夺德塔机器人的劳动成果.

特此申明.
2019年04月29日.

@yaoguangluo
Copy link
Owner Author

Deta在花大量时间休正37700+词汇商业语料库. 目前在完善成语语料库.

@yaoguangluo
Copy link
Owner Author

休正 休整 校正 修正 修整

@yaoguangluo
Copy link
Owner Author

deta 目前有63155个词汇, deta机器人进行系统的学习新华字典的词汇表, 获得了近25000个新词,这个词库同样不更新在该项目中, 具体分词质量可测试 : http://tinos.qicp.vip/data.html 的快速分词功能.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant