计算机工程与设计
計算機工程與設計
계산궤공정여설계
COMPUTER ENGINEERING AND DESIGN
2015年
8期
2133-2137
,共5页
霍迎秋%王武星%彭楚风%方勇
霍迎鞦%王武星%彭楚風%方勇
곽영추%왕무성%팽초풍%방용
实对称矩阵%分块迭代求逆%图形处理单元%统一计算设备架构%并行算法
實對稱矩陣%分塊迭代求逆%圖形處理單元%統一計算設備架構%併行算法
실대칭구진%분괴질대구역%도형처리단원%통일계산설비가구%병행산법
real symmetric matrix%block iterative inverse%GPU%CUDA%parallel algorithm
针对大型实对称矩阵数值求逆算法运算量大、计算时间长的问题,分析分块迭代求逆算法的并行性,设计基于CUDA (compute unified device architecture)的并行求逆算法。设计对比分析实验,实验结果表明,该算法能够提高大型实对称矩阵求逆的速度,当矩阵大小为8000×8000时,加速比高达279倍,很好满足了实际工程中对实时性要求高的需求,且计算精度基本保持不变。
針對大型實對稱矩陣數值求逆算法運算量大、計算時間長的問題,分析分塊迭代求逆算法的併行性,設計基于CUDA (compute unified device architecture)的併行求逆算法。設計對比分析實驗,實驗結果錶明,該算法能夠提高大型實對稱矩陣求逆的速度,噹矩陣大小為8000×8000時,加速比高達279倍,很好滿足瞭實際工程中對實時性要求高的需求,且計算精度基本保持不變。
침대대형실대칭구진수치구역산법운산량대、계산시간장적문제,분석분괴질대구역산법적병행성,설계기우CUDA (compute unified device architecture)적병행구역산법。설계대비분석실험,실험결과표명,해산법능구제고대형실대칭구진구역적속도,당구진대소위8000×8000시,가속비고체279배,흔호만족료실제공정중대실시성요구고적수구,차계산정도기본보지불변。
Some problems including the huge amount of computations and time-consumption exit in the process of computing the inverse matrix of a large real symmetric matrix using block-based iterative inversion algorithm.To solve these problems,a paral-lel analysis of the iterative inversion algorithm was made,and then it was optimized based on CUDA.The comparison between the sequential algorithm and the parallel one was designed.The experimental result shows that the parallel algorithm is able to improve the speed of the algorithm greatly.When the size of matrix is 8000×8000,the speedup is 279 times.Hence this parallel algorithm can meet the high real-time requirement well in actual applications,while maintaining almost the same accuracy.