计算机技术与发展
計算機技術與髮展
계산궤기술여발전
Computer Technology and Development
2015年
9期
43-47
,共5页
哈萨克语%句法分析%线图分析算法%规则库%句法树
哈薩剋語%句法分析%線圖分析算法%規則庫%句法樹
합살극어%구법분석%선도분석산법%규칙고%구법수
Kazakh%syntactic analysis%chart analysis algorithm%rule base%syntax tree
哈萨克语的理解一般分为以下步骤:原文输入、词语切分及词语属性特征标注、语法及句法分析、语义及语用和语境分析、生成目标形式表示、句群及篇章理解等。句子分析上接篇章理解,下联词汇分析,起着承上启下的作用。由于哈萨克语句法分析结果的准确度将对后续机器翻译的研究产生影响,在掌握哈萨克语词法分析技术的基础上,结合现代哈萨克语句法结构特点,首先介绍了厄尔利算法、GLR算法和线图算法三种基于规则的句法分析算法。通过实验对比发现,线图分析算法在哈萨克语简单句的分析中具有运算速度快和占用空间小的综合优势。针对传统线图分析算法冗余边较多造成分析准确率不高的现象引入规则库优化的改进线图算法,实验结果表明,改进后的线图算法使得准确率提高了4.19%,运行时间缩短了20倍。
哈薩剋語的理解一般分為以下步驟:原文輸入、詞語切分及詞語屬性特徵標註、語法及句法分析、語義及語用和語境分析、生成目標形式錶示、句群及篇章理解等。句子分析上接篇章理解,下聯詞彙分析,起著承上啟下的作用。由于哈薩剋語句法分析結果的準確度將對後續機器翻譯的研究產生影響,在掌握哈薩剋語詞法分析技術的基礎上,結閤現代哈薩剋語句法結構特點,首先介紹瞭阨爾利算法、GLR算法和線圖算法三種基于規則的句法分析算法。通過實驗對比髮現,線圖分析算法在哈薩剋語簡單句的分析中具有運算速度快和佔用空間小的綜閤優勢。針對傳統線圖分析算法冗餘邊較多造成分析準確率不高的現象引入規則庫優化的改進線圖算法,實驗結果錶明,改進後的線圖算法使得準確率提高瞭4.19%,運行時間縮短瞭20倍。
합살극어적리해일반분위이하보취:원문수입、사어절분급사어속성특정표주、어법급구법분석、어의급어용화어경분석、생성목표형식표시、구군급편장리해등。구자분석상접편장리해,하련사회분석,기착승상계하적작용。유우합살극어구법분석결과적준학도장대후속궤기번역적연구산생영향,재장악합살극어사법분석기술적기출상,결합현대합살극어구법결구특점,수선개소료액이리산법、GLR산법화선도산법삼충기우규칙적구법분석산법。통과실험대비발현,선도분석산법재합살극어간단구적분석중구유운산속도쾌화점용공간소적종합우세。침대전통선도분석산법용여변교다조성분석준학솔불고적현상인입규칙고우화적개진선도산법,실험결과표명,개진후적선도산법사득준학솔제고료4.19%,운행시간축단료20배。
The understanding of the Kazakh is generally divided into the following steps,the original input words,word segmentation and attribute features labeling,grammar and syntax analysis,semantics and pragmatics,and context analysis,generating target form,sentence group and text understanding,etc. Sentence analysis discourses text understanding,allying lexical analysis,playing the essential role. Be-cause the Kazakh syntactic analysis result accuracy influences the followed machine translation,based on mastering Kazakh lexical analy-sis technology,combined with the characteristics of modern Kazakh syntactic structure,first introduce the three rule-based parsing algo-rithms including Earley algorithm,GLR algorithm and chart analysis algorithm. The chart analysis algorithm has fast speed and small foot-print of the comprehensive advantages in simple Kazakh sentences analysis found by experimental comparison. The rule base optimization chart analysis algorithm is introduced to aim at the problem of low accuracy caused by more side redundancy,experimental results show that the algorithm makes the accuracy improved 4. 19%,the running time shortens 20 times.