计算机与数字工程
計算機與數字工程
계산궤여수자공정
Computer and Digital Engineering
2015年
10期
1829-1833
,共5页
事件要素%中文突发事件语料库%无向图%权重%自动文摘
事件要素%中文突髮事件語料庫%無嚮圖%權重%自動文摘
사건요소%중문돌발사건어료고%무향도%권중%자동문적
event element%CEC corpus%undirected graph%weight%automatic summarization
对传统自动文摘技术中容易产生的信息冗余和内容覆盖不全面问题,而目前主流的技术主要是面向词语的自动文摘。论文针对事件知识粒度下的事件要素在该问题上的有效性进行研究。首先通过标注好的 CEC 语料库来获取事件要素,然后构建事件要素无向图,其次再对无向图节点和无向边进行权值计算,最后得到简练的文摘句,进而按照原文本顺序输出文摘。实验主要在 CEC 语料库上进行,较其它方法而言,召回率和准确率取得了较好的效果,平均 F 值可达0.62,能更好地概括文本内容。
對傳統自動文摘技術中容易產生的信息冗餘和內容覆蓋不全麵問題,而目前主流的技術主要是麵嚮詞語的自動文摘。論文針對事件知識粒度下的事件要素在該問題上的有效性進行研究。首先通過標註好的 CEC 語料庫來穫取事件要素,然後構建事件要素無嚮圖,其次再對無嚮圖節點和無嚮邊進行權值計算,最後得到簡練的文摘句,進而按照原文本順序輸齣文摘。實驗主要在 CEC 語料庫上進行,較其它方法而言,召迴率和準確率取得瞭較好的效果,平均 F 值可達0.62,能更好地概括文本內容。
대전통자동문적기술중용역산생적신식용여화내용복개불전면문제,이목전주류적기술주요시면향사어적자동문적。논문침대사건지식립도하적사건요소재해문제상적유효성진행연구。수선통과표주호적 CEC 어료고래획취사건요소,연후구건사건요소무향도,기차재대무향도절점화무향변진행권치계산,최후득도간련적문적구,진이안조원문본순서수출문적。실험주요재 CEC 어료고상진행,교기타방법이언,소회솔화준학솔취득료교호적효과,평균 F 치가체0.62,능경호지개괄문본내용。
When adopting traditional automatic summarization ,it emerges information redundancy and incomplete con‐tent covering ,but currently the mainstream automatic summarization turns towards to extracting words .In this paper the ef‐fectiveness of this issue about event elements on the size of event is studied .Firstly the event elements through the tagged CEC corpus are obtained ;Then an event element undirected graph is built ,nodes’ and edges’ weights of the undirected graph are calculated ;Finally the concise summary sentences are gotten and the text summarization in accordance with the original text sequence is outputted .Experiments are conducted on CEC corpus ,recall and precision have got better results to many other methods and the average value F of this method can be raised to 0 .62 ,which can better generalize the text content .