计算机科学与探索
計算機科學與探索
계산궤과학여탐색
JOURNAL OF FRONTIERS OF COMPUTER SCIENCE & TECHNOLOGY
2013年
4期
368-376
,共9页
仲兆满%李存华%戴红伟%刘宗田
仲兆滿%李存華%戴紅偉%劉宗田
중조만%리존화%대홍위%류종전
话题演化%子话题聚类%内容特征%时间特征
話題縯化%子話題聚類%內容特徵%時間特徵
화제연화%자화제취류%내용특정%시간특정
topic evolution%subtopic clustering%content features%time features
子话题是对话题的再次划分,是比话题粒度更细的新兴研究方向,子话题的聚类是话题内部演化关系分析的基础.提出了融合内容特征和时间特征的中文新闻子话题聚类方法,重点分析了子话题内容特征的表现规律,研究了子话题特征词的权重计算和降维方法.选取5个话题的18个子话题进行了实验,结果表明,所提方法的性能与已有的子话题聚类方法相比有显著提高.
子話題是對話題的再次劃分,是比話題粒度更細的新興研究方嚮,子話題的聚類是話題內部縯化關繫分析的基礎.提齣瞭融閤內容特徵和時間特徵的中文新聞子話題聚類方法,重點分析瞭子話題內容特徵的錶現規律,研究瞭子話題特徵詞的權重計算和降維方法.選取5箇話題的18箇子話題進行瞭實驗,結果錶明,所提方法的性能與已有的子話題聚類方法相比有顯著提高.
자화제시대화제적재차화분,시비화제립도경세적신흥연구방향,자화제적취류시화제내부연화관계분석적기출.제출료융합내용특정화시간특정적중문신문자화제취류방법,중점분석료자화제내용특정적표현규률,연구료자화제특정사적권중계산화강유방법.선취5개화제적18개자화제진행료실험,결과표명,소제방법적성능여이유적자화제취류방법상비유현저제고.
Subtopic is the division for the topic, and it is a new research direction compared with the topic. Subtopic clustering is the base for the analysis of topic evolution relations. This paper proposes a new method of clustering Chinese news subtopic integrating content and time features. It focuses on the analysis of subtopic content feature in text, and studies the computation of subtopic word weights and the dimension reduction of subtopic words. Five topics including 18 subtopics are used to conduct the experiment. Experimental results show that the performance of the proposed method is better than the existing subtopic identification methods.