计算机工程
計算機工程
계산궤공정
COMPUTER ENGINEERING
2013年
7期
7-10,15
,共5页
张家杰%欧鹏%俞政%于学球%虞志益
張傢傑%歐鵬%俞政%于學毬%虞誌益
장가걸%구붕%유정%우학구%우지익
多核处理器%运算阵列%扩展寄存器%片上网络%快速傅里叶变换%H.264解码器
多覈處理器%運算陣列%擴展寄存器%片上網絡%快速傅裏葉變換%H.264解碼器
다핵처리기%운산진렬%확전기존기%편상망락%쾌속부리협변환%H.264해마기
multi-core processor%computing array%extended register%Network-on-Chip(NoC)%Fast Fourier Transform(FFT)%H.264 decoder
为提高多核处理器性能,在传统硬件加速部件的基础上,提出一种新型的运算阵列设计方案。将运算阵列与多核处理器的通信端口映射在扩展寄存器地址空间上,实现阵列与多核处理器的紧密耦合。通过片上网络连接各个运算单元,实现运算阵列的灵活配置和高度共享。在实验系统上实现1024点快速傅里叶变换和H.264解码器,结果表明,与纯软件实现相比,该方案能使处理器性能和功耗都有所改善。
為提高多覈處理器性能,在傳統硬件加速部件的基礎上,提齣一種新型的運算陣列設計方案。將運算陣列與多覈處理器的通信耑口映射在擴展寄存器地阯空間上,實現陣列與多覈處理器的緊密耦閤。通過片上網絡連接各箇運算單元,實現運算陣列的靈活配置和高度共享。在實驗繫統上實現1024點快速傅裏葉變換和H.264解碼器,結果錶明,與純軟件實現相比,該方案能使處理器性能和功耗都有所改善。
위제고다핵처리기성능,재전통경건가속부건적기출상,제출일충신형적운산진렬설계방안。장운산진렬여다핵처리기적통신단구영사재확전기존기지지공간상,실현진렬여다핵처리기적긴밀우합。통과편상망락련접각개운산단원,실현운산진렬적령활배치화고도공향。재실험계통상실현1024점쾌속부리협변환화H.264해마기,결과표명,여순연건실현상비,해방안능사처리기성능화공모도유소개선。
In order to improve the performance of multi-core processor, based on traditional hardware accelerator, this paper presents a novel computing array design scheme. The communication ports between the computing array and the processor are mapped in the address space of extended register file, which makes the computing array and the processor tightly coupled. The computing units are connected by Network-on-Chip(NoC) which enables the computing array be flexibly configured and highly shared by the multi-core processor. A 1 024-point Fast Fourier Transform(FFT) and an H.264 decoder are implemented on the experimental platform, and results show that the scheme can improve the performance and power consumption significantly compared to pure software solution.