Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG]按字频生成词频的功能存在优化的空间 #290

Open
lkyu-ly opened this issue Dec 23, 2023 · 0 comments
Open

[BUG]按字频生成词频的功能存在优化的空间 #290

lkyu-ly opened this issue Dec 23, 2023 · 0 comments

Comments

@lkyu-ly
Copy link

lkyu-ly commented Dec 23, 2023

若词典中出现生僻字,词频生成选择了软件自动生成词频的选项时,会出现:

开始生成词频...
给定关键字不在字典中,【 】

的报错,因为一两个生僻字导致整个词库不能导出,尤其是在词库条目达到百万级别时根本无法排查错字。

所以是否可以提供设置,选择把这部分词条的词频设置为固定值(即令他们不遵从词频生成);或者干脆添加过滤选项,直接滤去这部分词条

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants