计算机工程与科学
計算機工程與科學
계산궤공정여과학
COMPUTER ENGINEERING & SCIENCE
2009年
11期
33-36
,共4页
雷元武%窦勇%郭松%李鑫%雷国庆
雷元武%竇勇%郭鬆%李鑫%雷國慶
뢰원무%두용%곽송%리흠%뢰국경
舍入误差%LU分解%高精度乘累加
捨入誤差%LU分解%高精度乘纍加
사입오차%LU분해%고정도승루가
rounding error%LU decomposition%high-precision multiply and accumulate
本文首先分析LU分解中舍入误差的积累过程,建立精度损失与矩阵规模的关系模型来预测大规模LU分解的精度;然后,根据定点加法的简单、快速、无精度损失的特点,设计高精度乘累加器(HPMAcc),并基于此实现一个细粒度并行LU分解加速器.实验结果表明,和高精度软件库QD或MPFR相比,4PE结构的LU分解加速器能够取得100倍的加速比,同时取得90多位的计算精度.
本文首先分析LU分解中捨入誤差的積纍過程,建立精度損失與矩陣規模的關繫模型來預測大規模LU分解的精度;然後,根據定點加法的簡單、快速、無精度損失的特點,設計高精度乘纍加器(HPMAcc),併基于此實現一箇細粒度併行LU分解加速器.實驗結果錶明,和高精度軟件庫QD或MPFR相比,4PE結構的LU分解加速器能夠取得100倍的加速比,同時取得90多位的計算精度.
본문수선분석LU분해중사입오차적적루과정,건립정도손실여구진규모적관계모형래예측대규모LU분해적정도;연후,근거정점가법적간단、쾌속、무정도손실적특점,설계고정도승루가기(HPMAcc),병기우차실현일개세립도병행LU분해가속기.실험결과표명,화고정도연건고QD혹MPFR상비,4PE결구적LU분해가속기능구취득100배적가속비,동시취득90다위적계산정도.
In this paper we analyze the course of rounding error accumulation in the LU decomposition, and create a model, between the loss accuracy of the result and the scale of matrix, to predict the accuracy of large scale LU decomposi-tions. Then, we design a high-precision multiplying-accumulating (HPMAcc) unit in terms of the features of the simple, fast and error-free fixed-point add, and a fine-grain parallel LU decomposition accelerator based on this multiplying-accumu-lating unit Compared to the implementation of a high-precision software library such as QD or MPFR, the speed-up factors up to more than 100 are obtained. Meanwhile, more than 90 bits of accuracy can be achieved.