电脑知识与技术
電腦知識與技術
전뇌지식여기술
COMPUTER KNOWLEDGE AND TECHNOLOGY
2013年
33期
7590-7592
,共3页
艾山·吾买尔%早克热·卡德尔%买合木提·买买提%吐尔根·伊布拉音
艾山·吾買爾%早剋熱·卡德爾%買閤木提·買買提%吐爾根·伊佈拉音
애산·오매이%조극열·잡덕이%매합목제·매매제%토이근·이포랍음
维吾尔语%语言模型%二分查找%C#
維吾爾語%語言模型%二分查找%C#
유오이어%어언모형%이분사조%C#
Uyghur%Language model%Binary search%C#
由于SRILM语言模型计算工具包需要把全部模型数据载入内存,有些语言模型为几百兆或几千兆,对于个人计算机来说,每一次启动应用程序要载入几百兆的数据不仅耗内存,也很耗时,SRILM不适合于面向个人计算机的软件。该文中,为了解决面向个人用户的汉维机器翻译语言模型的计算问题,在SRILM基础上,实现了基于C#的即需即载数据的语言模型计算工具,数据用二分查找进行检索。
由于SRILM語言模型計算工具包需要把全部模型數據載入內存,有些語言模型為幾百兆或幾韆兆,對于箇人計算機來說,每一次啟動應用程序要載入幾百兆的數據不僅耗內存,也很耗時,SRILM不適閤于麵嚮箇人計算機的軟件。該文中,為瞭解決麵嚮箇人用戶的漢維機器翻譯語言模型的計算問題,在SRILM基礎上,實現瞭基于C#的即需即載數據的語言模型計算工具,數據用二分查找進行檢索。
유우SRILM어언모형계산공구포수요파전부모형수거재입내존,유사어언모형위궤백조혹궤천조,대우개인계산궤래설,매일차계동응용정서요재입궤백조적수거불부모내존,야흔모시,SRILM불괄합우면향개인계산궤적연건。해문중,위료해결면향개인용호적한유궤기번역어언모형적계산문제,재SRILM기출상,실현료기우C#적즉수즉재수거적어언모형계산공구,수거용이분사조진행검색。
Because the language model computing toolkits of SRILM require to put full model data loading at the initializing step, and some language models data were more than hundreds of megabytes or even more than thousands of megabytes, for the personal computer, loading hundreds of megabytes model data at the loading of application not only was memory consuming, but also very time consuming. SRILM is not suitable for personal computer software. In this paper, in order to solve the problem of individual user oriented Chinese Uyghur machine translation language model, on the basis of SRILM, implemented On-demand loading language model computing toolkit using C#, data is retrieved using binary search.