计算机工程与设计
計算機工程與設計
계산궤공정여설계
COMPUTER ENGINEERING AND DESIGN
2015年
6期
1647-1651
,共5页
苏祥坤%吾守尔?斯拉木%买买提依明?哈斯木
囌祥坤%吾守爾?斯拉木%買買提依明?哈斯木
소상곤%오수이?사랍목%매매제의명?합사목
权重%词序%关键词%单文本%词语组合
權重%詞序%關鍵詞%單文本%詞語組閤
권중%사서%관건사%단문본%사어조합
weight%word order%keywords%single text%word combinations
为进一步改善关键词提取的效果,提出一种基于词序统计组合的关键词提取方法。通过词序统计、词性标注、停用词过滤、词语组合等步骤,实现短语或组合词的生成和候选关键词的过滤;通过其它特征项的引入,进一步提高最终提取关键词的准确度。实验结果表明,该方法对中文文本的关键词提取具有良好的效果。
為進一步改善關鍵詞提取的效果,提齣一種基于詞序統計組閤的關鍵詞提取方法。通過詞序統計、詞性標註、停用詞過濾、詞語組閤等步驟,實現短語或組閤詞的生成和候選關鍵詞的過濾;通過其它特徵項的引入,進一步提高最終提取關鍵詞的準確度。實驗結果錶明,該方法對中文文本的關鍵詞提取具有良好的效果。
위진일보개선관건사제취적효과,제출일충기우사서통계조합적관건사제취방법。통과사서통계、사성표주、정용사과려、사어조합등보취,실현단어혹조합사적생성화후선관건사적과려;통과기타특정항적인입,진일보제고최종제취관건사적준학도。실험결과표명,해방법대중문문본적관건사제취구유량호적효과。
To improve the effect of the keyword extraction ,a method based on the combination of the word order was proposed . Through steps including the statistic of word order ,the POS tagging , the filtering of the stop words , words combination ,the phrase or the combination of the word was constructed ,and the candidate of keyword was filtered .On the other hand ,the accu‐racy of the final keyword extraction was improved greatly by the introduction of the other features .The experimental results show that the method has a great contribution to the Chinese text keyword extraction .