Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

分词优化:将UP主名字作为一个完整的词加入词库 #62

Open
ljhcage opened this issue Jul 15, 2023 · 4 comments
Open

分词优化:将UP主名字作为一个完整的词加入词库 #62

ljhcage opened this issue Jul 15, 2023 · 4 comments

Comments

@ljhcage
Copy link

ljhcage commented Jul 15, 2023

有很多UP主的名字很容易被拆散成多个词,建议在分词时将当前UP主的名字加入词库避免被拆散

@gaogaotiantian
Copy link
Owner

很遗憾的是,分词本身是不受控的,是浏览器功能,没办法给词库。

@ljhcage
Copy link
Author

ljhcage commented Jul 16, 2023

那在输入分词器前,把描述和标题中UP主的名字用引号引起来,会不会让分词器更好地识别UP主的名字

@ljhcage
Copy link
Author

ljhcage commented Jul 16, 2023

那在输入分词器前,把描述和标题中UP主的名字用引号引起来,会不会让分词器更好地识别UP主的名字

简单验证了一下,中英文引号,空格,均无法动摇分词器的结果 -_-||

@F-park
Copy link
Contributor

F-park commented Dec 30, 2023

解决方案

Tip

up 主的名字可以通过 UserProfileCard 里的 this.data["name"] 获取到

在分词前把 up 主的名字剪切出来,直接按权重加到 wordMap 里。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants