计算机工程与设计
計算機工程與設計
계산궤공정여설계
COMPUTER ENGINEERING AND DESIGN
2014年
10期
3602-3607
,共6页
汪泱%古丽拉·阿东别克%户冰心%牛宁宁
汪泱%古麗拉·阿東彆剋%戶冰心%牛寧寧
왕앙%고려랍·아동별극%호빙심%우저저
基本短语识别%条件随机场%特征模板自动选择%哈萨克语%贪心策略
基本短語識彆%條件隨機場%特徵模闆自動選擇%哈薩剋語%貪心策略
기본단어식별%조건수궤장%특정모판자동선택%합살극어%탐심책략
base phrase identification%conditional random fields%automatic selection of feature template%Kazakh%greedy strategy
为解决识别哈萨克语基本短语的问题,提出一种基于条件随机场模型的哈萨克语基本短语自动识别方法。利用基于贪心策略的特征模板自动选择算法,结合哈萨克语基本短语的特点,从众多上下文特征中选取出合适的特征;每次从备选特征模板中挑选出局部最优的特征模板项,加入到最终的特征模板中,进一步提高识别准确率。实验结果表明,该方法的识别准确率和召回率分别达到了89.01%和84.07%。
為解決識彆哈薩剋語基本短語的問題,提齣一種基于條件隨機場模型的哈薩剋語基本短語自動識彆方法。利用基于貪心策略的特徵模闆自動選擇算法,結閤哈薩剋語基本短語的特點,從衆多上下文特徵中選取齣閤適的特徵;每次從備選特徵模闆中挑選齣跼部最優的特徵模闆項,加入到最終的特徵模闆中,進一步提高識彆準確率。實驗結果錶明,該方法的識彆準確率和召迴率分彆達到瞭89.01%和84.07%。
위해결식별합살극어기본단어적문제,제출일충기우조건수궤장모형적합살극어기본단어자동식별방법。이용기우탐심책략적특정모판자동선택산법,결합합살극어기본단어적특점,종음다상하문특정중선취출합괄적특정;매차종비선특정모판중도선출국부최우적특정모판항,가입도최종적특정모판중,진일보제고식별준학솔。실험결과표명,해방법적식별준학솔화소회솔분별체도료89.01%화84.07%。
To solve the problem of identifying Kazakh basic phrases ,an automatic identification method was presented based on conditional random fields .There are many features around the context in the process of identification ,and an automatic selection method of feature template based on the greedy algorithm was adopted to select features to combine with characters of Kazakh base phrases .In this algorithm ,relatively best feature items were added to the final feature template at each time ,and the recog-nition precision was improved .Experimental results show the recognition precision and the recall rate reach 89.01% and 84.07%respectively .