西安电子科技大学学报(自然科学版)
西安電子科技大學學報(自然科學版)
서안전자과기대학학보(자연과학판)
JOURNAL OF XIDIAN UNIVERSITY(NATURAL SCIENCE)
2015年
3期
135-140,191
,共7页
信号处理%并行计算%图形处理器%程序优化%连续小波变换
信號處理%併行計算%圖形處理器%程序優化%連續小波變換
신호처리%병행계산%도형처리기%정서우화%련속소파변환
signal processing%parallel computing%graphics processing unit (GPU)%program optimization%continuous wavelet transform (CWT)
从宽带相关的角度推导了基于小波变换的匹配滤波算法及基于快速傅里叶变换(FFT)算法,并分析了算法复杂度,提出了基于图形处理器(GPU)的可配置宽带匹配滤波的软件实现和理论预测与函数实测结合的优化方法.通过优化线程块的维度、绑定纹理寄存器来改进内核函数性能,再使用计算统一设备架构(CUDA)库来降低FFT与极值搜索的时延,并进行了性能优化设计.在性能测试中,文中方法在 GPU平台的实现相比8核CPU平台的实现具有3.3倍加速比,其处理时延能够满足宽带匹配滤波的实时性需求.
從寬帶相關的角度推導瞭基于小波變換的匹配濾波算法及基于快速傅裏葉變換(FFT)算法,併分析瞭算法複雜度,提齣瞭基于圖形處理器(GPU)的可配置寬帶匹配濾波的軟件實現和理論預測與函數實測結閤的優化方法.通過優化線程塊的維度、綁定紋理寄存器來改進內覈函數性能,再使用計算統一設備架構(CUDA)庫來降低FFT與極值搜索的時延,併進行瞭性能優化設計.在性能測試中,文中方法在 GPU平檯的實現相比8覈CPU平檯的實現具有3.3倍加速比,其處理時延能夠滿足寬帶匹配濾波的實時性需求.
종관대상관적각도추도료기우소파변환적필배려파산법급기우쾌속부리협변환(FFT)산법,병분석료산법복잡도,제출료기우도형처리기(GPU)적가배치관대필배려파적연건실현화이론예측여함수실측결합적우화방법.통과우화선정괴적유도、방정문리기존기래개진내핵함수성능,재사용계산통일설비가구(CUDA)고래강저FFT여겁치수색적시연,병진행료성능우화설계.재성능측시중,문중방법재 GPU평태적실현상비8핵CPU평태적실현구유3.3배가속비,기처리시연능구만족관대필배려파적실시성수구.
The fine estimation of wideband ambiguity,which has a sharp main ridge,requires large amounts of searching on the time-scale. That desperately needs the well-optimized software on high performance hardware.In terms of wideband correlation,the matched filter based on the CWT and its fast algorithm based on the FFT are studied,and furthermore its complexity is analyzed.Then a reconfigurable implementation on the GPU is proposed,and a method of optimization that combines analysis with testing is proposed.By optimizing the dimension of the thread block and utilizing texture memory,the time of the kernel is reduced;the CUDA library is introduced,so the delays of the FFT and maximum searching are reduced.In comparison with the method in the 8-core CPU,the proposed method improves the overall performance up to 3.3 times.The speed can meet the challenge of real-time processing of the wideband matched filter.