计算机系统应用
計算機繫統應用
계산궤계통응용
APPLICATIONS OF THE COMPUTER SYSTEMS
2010年
1期
171-175
,共5页
苏波%李凯%徐志广%何颂颂
囌波%李凱%徐誌廣%何頌頌
소파%리개%서지엄%하송송
ATLAS%KD-50-I%cache失效%非阻塞cache
ATLAS%KD-50-I%cache失效%非阻塞cache
ATLAS%KD-50-I%cache실효%비조새cache
ATLAS%KD-50-I%cache miss%non-blocking cache
一般的数据处理程序中,计算时间在其中往往只起次要作用,因此访存方式是否有效对程序的性能影响很大.在基于龙芯2F处理器研制的高性能计算机系统KD-50-1上安装ATLAS,经测试其性能只达到龙芯2F理论峰值的30%.通过循环展开减少函数存储访问次数,增大计算访存比;采用数据分块、部分拷贝以增强访存局部性,减少cache失效;利用非阻塞cache加快内存访问速度等访存优化技术,将ATLAS性能提高50%以上.
一般的數據處理程序中,計算時間在其中往往隻起次要作用,因此訪存方式是否有效對程序的性能影響很大.在基于龍芯2F處理器研製的高性能計算機繫統KD-50-1上安裝ATLAS,經測試其性能隻達到龍芯2F理論峰值的30%.通過循環展開減少函數存儲訪問次數,增大計算訪存比;採用數據分塊、部分拷貝以增彊訪存跼部性,減少cache失效;利用非阻塞cache加快內存訪問速度等訪存優化技術,將ATLAS性能提高50%以上.
일반적수거처리정서중,계산시간재기중왕왕지기차요작용,인차방존방식시부유효대정서적성능영향흔대.재기우룡심2F처리기연제적고성능계산궤계통KD-50-1상안장ATLAS,경측시기성능지체도룡심2F이론봉치적30%.통과순배전개감소함수존저방문차수,증대계산방존비;채용수거분괴、부분고패이증강방존국부성,감소cache실효;이용비조새cache가쾌내존방문속도등방존우화기술,장ATLAS성능제고50%이상.
In most cases,compared to computing time,memory access time takes a much larger proportion of program running time.Therefore,memory access approach can affect the program performance significantly.Testing results show that the performance of ATLAS transplanted on KD-50-I,which is based on Loongson 2F,reaches only 30% of its theoretical peak.In this paper,by exploiting Loop Unrolling technique to decrease memory access frequency,enhancing time and space locality to reduce cache misses and nonblocking cache mechanism to form memory access pipeline,the performance of optimized ATLAS Can be improved to 50% higher.