计算机科学
計算機科學
계산궤과학
COMPUTER SCIENCE
2010年
5期
197-202,290
,共7页
自然语言处理%语义分析%修饰关系%知识库
自然語言處理%語義分析%脩飾關繫%知識庫
자연어언처리%어의분석%수식관계%지식고
NLP%Syntax%Semantic%Modifying relations%Knowledge base
自然语言语义分析是自然语言处理技术走向深层应用的瓶颈.当前在概念、关系层次上的语义分析方法主要有两种:基于统计的特征向量抽取方法和基于语义词典(WordNet、HowNet等)的语义相似度计算方法.对于具体应用这两种方法都具有较大不足,前者由于统计模型的关系只适用于段落、篇章或多文档等粗粒度的语义分析,而不适合在句子词汇一级的应用;后者能方便处理实体概念之间的各种关系,但是如果想正确处理真实文本中的复杂修饰关系如概念与事件、概念与概念修饰、事件与事件修饰等关系,还需对语义词典和计算方法做进一步的扩展.提出了按照真实文本语句中词语之间修饰关系建立知识库,并设计了根据该知识库中已有修饰关系计算未知关系的算法;提出了可以依照修饰关系建立自然语言构句法的思路并给出了相关算法;最后给出了在语义分析系统上的实验,结果证明该方法是有效的.
自然語言語義分析是自然語言處理技術走嚮深層應用的瓶頸.噹前在概唸、關繫層次上的語義分析方法主要有兩種:基于統計的特徵嚮量抽取方法和基于語義詞典(WordNet、HowNet等)的語義相似度計算方法.對于具體應用這兩種方法都具有較大不足,前者由于統計模型的關繫隻適用于段落、篇章或多文檔等粗粒度的語義分析,而不適閤在句子詞彙一級的應用;後者能方便處理實體概唸之間的各種關繫,但是如果想正確處理真實文本中的複雜脩飾關繫如概唸與事件、概唸與概唸脩飾、事件與事件脩飾等關繫,還需對語義詞典和計算方法做進一步的擴展.提齣瞭按照真實文本語句中詞語之間脩飾關繫建立知識庫,併設計瞭根據該知識庫中已有脩飾關繫計算未知關繫的算法;提齣瞭可以依照脩飾關繫建立自然語言構句法的思路併給齣瞭相關算法;最後給齣瞭在語義分析繫統上的實驗,結果證明該方法是有效的.
자연어언어의분석시자연어언처리기술주향심층응용적병경.당전재개념、관계층차상적어의분석방법주요유량충:기우통계적특정향량추취방법화기우어의사전(WordNet、HowNet등)적어의상사도계산방법.대우구체응용저량충방법도구유교대불족,전자유우통계모형적관계지괄용우단락、편장혹다문당등조립도적어의분석,이불괄합재구자사회일급적응용;후자능방편처리실체개념지간적각충관계,단시여과상정학처리진실문본중적복잡수식관계여개념여사건、개념여개념수식、사건여사건수식등관계,환수대어의사전화계산방법주진일보적확전.제출료안조진실문본어구중사어지간수식관계건립지식고,병설계료근거해지식고중이유수식관계계산미지관계적산법;제출료가이의조수식관계건립자연어언구구법적사로병급출료상관산법;최후급출료재어의분석계통상적실험,결과증명해방법시유효적.
Acquiring the meaning of natural language is a bottleneck to make deeper use of natural language processing (NLP).There are two main measures on analyzing meaning of natural language at conception-relation level:one is the method of extracting characteristic vectors based on statistics,and the other one is method of computing semantic similarities according to semantic dictionary like WordNet or HowNet.Both of the two methods have weakness when putring them to applications.The previous is only applicable to analyze the meaning of those materials with big granularities such as paragraphs,documents or multi-documents,but is not fit for the applications at the level of sentences or words.The latter can deal all sorts of relations between conceptions easily,but when coming to complicated modified relations between conceptions and events,conceptions and conceptions or events and events,the semantic dictionary and computing method shall be extended.This paper presented a new method to structure semantic knowledge base(SKB)according to the modifying relation of real context;algorithm of computing unknown relations on the knowledge base was presented;we pointed out the way to design the rules of constructing natural language sentences under modifying relation and present the algorithm;in the end we made experiment on the platform developed in the light of the theory mentioned above and the result shows the theory is feasible.