西安电子科技大学学报(自然科学版)
西安電子科技大學學報(自然科學版)
서안전자과기대학학보(자연과학판)
JOURNAL OF XIDIAN UNIVERSITY(NATURAL SCIENCE)
2015年
1期
194-199,212
,共7页
时域有限差分法%卷积完全匹配层%图形处理器%并行计算%计算统一设备架构
時域有限差分法%捲積完全匹配層%圖形處理器%併行計算%計算統一設備架構
시역유한차분법%권적완전필배층%도형처리기%병행계산%계산통일설비가구
finite difference time domain method%convolution perfectly matched layer%graphics processing unit%parallel computing%compute unified device architecture
针对并行CPML存在的计算冗余和访问冗余问题,提出了一种用于时域有限差分法的图形处理器加速无除法联合最小访存CPML更新方案。该方案通过重新安排 CPML 迭代公式,将除法操作吸收进公式的固定系数中,消去了图形处理器计算中负担繁重的除法操作。该方案进一步通过合并 P ML 区域内时域有限差分法常规场值更新步骤和CPML更新步骤,剔除了这两个步骤中的重复访存,使算法的访存需求最小化。数值验证结果表明,在同等精度下,CPML 更新过程和 PML 区域场值整体计算过程分别减少了70%和44%的计算时间。
針對併行CPML存在的計算冗餘和訪問冗餘問題,提齣瞭一種用于時域有限差分法的圖形處理器加速無除法聯閤最小訪存CPML更新方案。該方案通過重新安排 CPML 迭代公式,將除法操作吸收進公式的固定繫數中,消去瞭圖形處理器計算中負擔繁重的除法操作。該方案進一步通過閤併 P ML 區域內時域有限差分法常規場值更新步驟和CPML更新步驟,剔除瞭這兩箇步驟中的重複訪存,使算法的訪存需求最小化。數值驗證結果錶明,在同等精度下,CPML 更新過程和 PML 區域場值整體計算過程分彆減少瞭70%和44%的計算時間。
침대병행CPML존재적계산용여화방문용여문제,제출료일충용우시역유한차분법적도형처리기가속무제법연합최소방존CPML경신방안。해방안통과중신안배 CPML 질대공식,장제법조작흡수진공식적고정계수중,소거료도형처리기계산중부담번중적제법조작。해방안진일보통과합병 P ML 구역내시역유한차분법상규장치경신보취화CPML경신보취,척제료저량개보취중적중복방존,사산법적방존수구최소화。수치험증결과표명,재동등정도하,CPML 경신과정화 PML 구역장치정체계산과정분별감소료70%화44%적계산시간。
To overcome computational redundancy and memory-access redundancy of the traditional GPU-accelerated CPML technique,a novel division-free and minimum-access CPML scheme is proposed.In the proposed scheme,the division operators in the CPML method are merged into a series of fixed coefficients by optimally rearranging the iteration process of CPML and then,the reduplicate memory accesses are eliminated by updating the FDTD and CPML operation in the PML region jointly.Experimental results show that the proposed structure can save up to 70% operation time compared with the traditional GPU-CPML technique and 44% of field updating in the PML region,without any loss of accuracy.