宁波大学学报(理工版)
寧波大學學報(理工版)
저파대학학보(리공판)
JOURNAL OF NINGBO UNIVERSITY(NSEE)
2013年
4期
40-44
,共5页
同义词词林%文本关系图%段落相似度%主题划分%归一化割
同義詞詞林%文本關繫圖%段落相似度%主題劃分%歸一化割
동의사사림%문본관계도%단락상사도%주제화분%귀일화할
Tongyici Cilin%text relation map%paragraph similarity%topic partition%Normalized Cut
为了保证抽取信息的全面性,主题划分成了不可或缺的工作。借助同义词词林,从词语的语义角度计算文本中各个段落间的相似度,建立段落文本关系图。基于文本关系图对归一化割分割准则中权值矩阵的构建做出调整,使之更能体现出段落间的相似程度,并使用该准则对文本进行主题划分。结果表明,该方法无论是对连续段落还是跨段落表达同一主题的主题划分均较为有效。
為瞭保證抽取信息的全麵性,主題劃分成瞭不可或缺的工作。藉助同義詞詞林,從詞語的語義角度計算文本中各箇段落間的相似度,建立段落文本關繫圖。基于文本關繫圖對歸一化割分割準則中權值矩陣的構建做齣調整,使之更能體現齣段落間的相似程度,併使用該準則對文本進行主題劃分。結果錶明,該方法無論是對連續段落還是跨段落錶達同一主題的主題劃分均較為有效。
위료보증추취신식적전면성,주제화분성료불가혹결적공작。차조동의사사림,종사어적어의각도계산문본중각개단락간적상사도,건립단락문본관계도。기우문본관계도대귀일화할분할준칙중권치구진적구건주출조정,사지경능체현출단락간적상사정도,병사용해준칙대문본진행주제화분。결과표명,해방법무론시대련속단락환시과단락표체동일주제적주제화분균교위유효。
To ensure the completeness of information extraction, the topic partition is one of the indispensable tasks. With the aid of Tongyici Cilin, we first seek the similarity between paragraphs from the point of semantic computing, based on which we then establish text relation map. Using and accordingly adjusting the weight matrix, the degree of similarity between paragraphs can be more accurately obtained, in which the Normalized Cut approach is adopted to complete the topic partition of text. The experimental results show that the method is effective either for consecutive paragraphs or for cross-paragraphs expressing a similar topic.