This website works better with JavaScript.
Home
Issues
Pull Requests
Milestones
Repositories
Datasets
Forum
实训
竞赛
大数据
应用
Register
Sign In
OpenI
/
jiqixuexiyushenduxuexi
Not watched
Unwatch
Watch all
Watch but not notify
1
Star
0
Fork
0
Code
Releases
0
Wiki
Activity
Issues
0
Pull Requests
0
Datasets
Model
Cloudbrain
Browse Source
候选词语的最大字数改为7
master
wangsheng
3 years ago
parent
612681d0f8
commit
5a6b882b53
1 changed files
with
1 additions
and
1 deletions
Split View
Diff Options
Show Stats
Download Patch File
Download Diff File
+1
-1
自然语言处理/短语挖掘与新词发现/苏剑林/main_sujianlin.py
+ 1
- 1
自然语言处理/短语挖掘与新词发现/苏剑林/main_sujianlin.py
View File
@@ -17,7 +17,7 @@ myre = {2:'(..)', 3:'(...)', 4:'(....)', 5:'(.....)', 6:'(......)', 7:'(.......)
min_count = 10 #录取词语最小出现次数
min_support = 30 #录取词语最低支持度,1代表着随机组合
min_s = 3 #录取词语最低信息熵,越大说明越有可能独立成词
max_sep =
4
#候选词语的最大字数
max_sep =
7
#候选词语的最大字数
t=[] #保存结果用。
t.append(pd.Series(list(s)).value_counts()) #逐字统计
Write
Preview
Loading…
Cancel
Save