山东大学学报(理学版)
山東大學學報(理學版)
산동대학학보(이학판)
JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE)
2014年
1期
76-79
,共4页
潘清清%周枫%余正涛%郭剑毅%线岩团
潘清清%週楓%餘正濤%郭劍毅%線巖糰
반청청%주풍%여정도%곽검의%선암단
越南语%命名实体识别%条件随机场%机器学习
越南語%命名實體識彆%條件隨機場%機器學習
월남어%명명실체식별%조건수궤장%궤기학습
Vietnamese named entity recognition%feature selection%conditional random fields%machine learning
针对越南语特点,提出一种基于条件随机场模型的越语命名实体识别方法。该方法针对越语词和词性的特点,采用条件随机场算法,选取词和词性作为特征,定义特征模版,选取越南语新闻文本,标记地名、人名、组织机构等6类实体语料,训练获得越南语实体识别模型,实现实体识别。实验结果表明该方法提取实体的准确率达到83.73%。
針對越南語特點,提齣一種基于條件隨機場模型的越語命名實體識彆方法。該方法針對越語詞和詞性的特點,採用條件隨機場算法,選取詞和詞性作為特徵,定義特徵模版,選取越南語新聞文本,標記地名、人名、組織機構等6類實體語料,訓練穫得越南語實體識彆模型,實現實體識彆。實驗結果錶明該方法提取實體的準確率達到83.73%。
침대월남어특점,제출일충기우조건수궤장모형적월어명명실체식별방법。해방법침대월어사화사성적특점,채용조건수궤장산법,선취사화사성작위특정,정의특정모판,선취월남어신문문본,표기지명、인명、조직궤구등6류실체어료,훈련획득월남어실체식별모형,실현실체식별。실험결과표명해방법제취실체적준학솔체도83.73%。
A method of named entity recognition is proposed based on conditional random fields model aimed at the lan-guage feature of Vietnamese.This method aims at the feature of word and part of speech, adopts the arithmetic of con-ditional random fields, selects the word and part of speech as the feature, defines the feature template, chooses the news text of Vietnamese, tags the six entity linguistic data such as place name, person name and organization, trains the Viet-namese entity recognition model which acquired.Vietnamese entity recognition experiment results prove that the entity recognition accuracy rate of this method reach 83.73%.