计算机工程与设计
計算機工程與設計
계산궤공정여설계
COMPUTER ENGINEERING AND DESIGN
2014年
8期
2944-2948
,共5页
麦合甫热提%米日姑·肉孜%麦热哈巴·艾力%吐尔根·依布拉音
麥閤甫熱提%米日姑·肉孜%麥熱哈巴·艾力%吐爾根·依佈拉音
맥합보열제%미일고·육자%맥열합파·애력%토이근·의포랍음
自然语言处理%命名实体识别%机构名识别%知识库%规则匹配
自然語言處理%命名實體識彆%機構名識彆%知識庫%規則匹配
자연어언처리%명명실체식별%궤구명식별%지식고%규칙필배
natural language processing%named entity recognition%organization name recognition%knowledge base%rule matching
为了提高维吾尔语中机构名的自动识别准确率,从维吾尔语的语言特点出发,对维吾尔语中机构名的组织结构进行了分类并将其形式化表示;根据此特征设计出有效地识别规则,创建了特征词库、地名库和修饰词库等知识库;设计并实现了基于状态转移原理的高效识别算法。实验结果表明,该算法识别的F值达到83.05%,获得了较好结果。
為瞭提高維吾爾語中機構名的自動識彆準確率,從維吾爾語的語言特點齣髮,對維吾爾語中機構名的組織結構進行瞭分類併將其形式化錶示;根據此特徵設計齣有效地識彆規則,創建瞭特徵詞庫、地名庫和脩飾詞庫等知識庫;設計併實現瞭基于狀態轉移原理的高效識彆算法。實驗結果錶明,該算法識彆的F值達到83.05%,穫得瞭較好結果。
위료제고유오이어중궤구명적자동식별준학솔,종유오이어적어언특점출발,대유오이어중궤구명적조직결구진행료분류병장기형식화표시;근거차특정설계출유효지식별규칙,창건료특정사고、지명고화수식사고등지식고;설계병실현료기우상태전이원리적고효식별산법。실험결과표명,해산법식별적F치체도83.05%,획득료교호결과。
To improve the automatic recognation of organization name in Uyghur ,through analysis of the characterstics of Uy-ghur organization name ,the following work was done .First ,the organization name in Uyghur was classified depending on its structure and it was formally described .After then ,effective recognizing rules were desingned according to these features , knowledge base was created such as features word base ,place name base and qualifier word base .Finally ,efficient recognition algorithm was designed and implemented based on the principles of state transition .Representative examples from the Tianshan net news were selected to build the test set for organization name recognition ,experimental results showed that ,this method a-chieved better results with the F measure of 83.05% .