河南城建学院学报
河南城建學院學報
하남성건학원학보
JOURNAL OF PINGDINGSHAN INSTITUTE OF TECHNOLOGY
2012年
4期
48-50
,共3页
共现关系%特征扩展%短文本分类
共現關繫%特徵擴展%短文本分類
공현관계%특정확전%단문본분류
co-occurrence relationship%features extension%short text classification
针对短文本单一共现词特征扩展效果不理想的情况,提出一种改进的基于共现关系的短文本特征扩展算法,改进之处在于考虑了多个共现词同时出现的情况,改进了特征词权重计算公式及特征扩展策略,并应用于中文短文本分类,使分类准确度得到了一定提升。
針對短文本單一共現詞特徵擴展效果不理想的情況,提齣一種改進的基于共現關繫的短文本特徵擴展算法,改進之處在于攷慮瞭多箇共現詞同時齣現的情況,改進瞭特徵詞權重計算公式及特徵擴展策略,併應用于中文短文本分類,使分類準確度得到瞭一定提升。
침대단문본단일공현사특정확전효과불이상적정황,제출일충개진적기우공현관계적단문본특정확전산법,개진지처재우고필료다개공현사동시출현적정황,개진료특정사권중계산공식급특정확전책략,병응용우중문단문본분류,사분류준학도득도료일정제승。
In this paper, an improved expansion algorithm based on co-occurrence relationship between short text feature is proposed aimed at not ideal situation for a single co-occurrence word feature expansion. The im- provement is that we considered more than a total of the current words at the same time and improved features of the word weight calculation formula and characteristics of expansion strategy, and applied to the Chinese short-text classification, which has improved classification accuracy.