计算机工程与应用
計算機工程與應用
계산궤공정여응용
COMPUTER ENGINEERING AND APPLICATIONS
2014年
14期
135-138,254
,共5页
张建周%哈力木拉提·买买提%陈晓娇
張建週%哈力木拉提·買買提%陳曉嬌
장건주%합력목랍제·매매제%진효교
维吾尔文文字识别%连体段%聚类算法%等间距法%有效相似比%正确率
維吾爾文文字識彆%連體段%聚類算法%等間距法%有效相似比%正確率
유오이문문자식별%련체단%취류산법%등간거법%유효상사비%정학솔
Uyghur character recognition%word-part%clustering algorithm%equal interval method%effective similarity ratio%accuracy
在维吾尔文文字识别中,能否有效地聚类将直接影响识别结果的好坏。为改善聚类效果,针对维吾尔文连体段聚类,提出了一种改进的K-means聚类算法。该算法首先采用等间距法多次选择类中心,然后选择最佳码本和利用有效相似比来动态调整聚类个数K,最后完成了连体段聚类。实验结果表明:与传统K-means算法相比,改进的K-means算法得到了较好聚类效果,聚类正确率达90%以上。
在維吾爾文文字識彆中,能否有效地聚類將直接影響識彆結果的好壞。為改善聚類效果,針對維吾爾文連體段聚類,提齣瞭一種改進的K-means聚類算法。該算法首先採用等間距法多次選擇類中心,然後選擇最佳碼本和利用有效相似比來動態調整聚類箇數K,最後完成瞭連體段聚類。實驗結果錶明:與傳統K-means算法相比,改進的K-means算法得到瞭較好聚類效果,聚類正確率達90%以上。
재유오이문문자식별중,능부유효지취류장직접영향식별결과적호배。위개선취류효과,침대유오이문련체단취류,제출료일충개진적K-means취류산법。해산법수선채용등간거법다차선택류중심,연후선택최가마본화이용유효상사비래동태조정취류개수K,최후완성료련체단취류。실험결과표명:여전통K-means산법상비,개진적K-means산법득도료교호취류효과,취류정학솔체90%이상。
In Uyghur character recognition, the effect of the cluster will affect the recognition rate directly. To improve the clustering result, an improved K-means clustering algorithm based on Uyghur word-part is presented. The first step of the method is to select the center of the clustering by using the equal interval method repeatedly in order to select the best codebook, then adjust the number of clustering classes(noted as K)by using an effective similarity ratio dynamically. Finally, the word-part clustering is completed. The experimental results show that:compared with the traditional K-means algorithm, the improved K-means algorithm gets a better result and the clustering accuracy is more than 90%.