解放军理工大学学报(自然科学版)
解放軍理工大學學報(自然科學版)
해방군리공대학학보(자연과학판)
Journal of PLA University of Science and Technology (Natural Science Edition)
2015年
5期
407-412
,共6页
张立伟%张雄伟%胡永刚%闵刚%李轶南
張立偉%張雄偉%鬍永剛%閔剛%李軼南
장립위%장웅위%호영강%민강%리질남
语音增强%贝叶斯非负矩阵%多元Laplace分布
語音增彊%貝葉斯非負矩陣%多元Laplace分佈
어음증강%패협사비부구진%다원Laplace분포
speech enhancement%Bayesian nonnegative matrix%multivariate Laplace distribution
为了进一步提高增强语音的质量,基于传统的贝叶斯非负矩阵分解语音增强算法,考虑语音帧内原子间的相关性,提出了一种新的改进贝叶斯非负矩阵分解语音增强算法。该算法可分为训练和增强2个阶段:训练阶段利用该算法分别对纯净语音和噪声进行训练,得到纯净语音和噪声字典;增强阶段利用训练得到的纯净语音和噪声字典组成的联合字典结合,计算带噪语音时变增益,并利用最小均方误差估计得到增强语音频谱,进而重构增强语音。实验结果表明,该算法的对数频谱距离值和主观语音质量评估打分均优于非负矩阵分解(NMF)和贝叶斯非负矩阵分解(BNMF)等传统的语音增强算法,特别是在低信噪比条件下,该算法增强的效果更佳。
為瞭進一步提高增彊語音的質量,基于傳統的貝葉斯非負矩陣分解語音增彊算法,攷慮語音幀內原子間的相關性,提齣瞭一種新的改進貝葉斯非負矩陣分解語音增彊算法。該算法可分為訓練和增彊2箇階段:訓練階段利用該算法分彆對純淨語音和譟聲進行訓練,得到純淨語音和譟聲字典;增彊階段利用訓練得到的純淨語音和譟聲字典組成的聯閤字典結閤,計算帶譟語音時變增益,併利用最小均方誤差估計得到增彊語音頻譜,進而重構增彊語音。實驗結果錶明,該算法的對數頻譜距離值和主觀語音質量評估打分均優于非負矩陣分解(NMF)和貝葉斯非負矩陣分解(BNMF)等傳統的語音增彊算法,特彆是在低信譟比條件下,該算法增彊的效果更佳。
위료진일보제고증강어음적질량,기우전통적패협사비부구진분해어음증강산법,고필어음정내원자간적상관성,제출료일충신적개진패협사비부구진분해어음증강산법。해산법가분위훈련화증강2개계단:훈련계단이용해산법분별대순정어음화조성진행훈련,득도순정어음화조성자전;증강계단이용훈련득도적순정어음화조성자전조성적연합자전결합,계산대조어음시변증익,병이용최소균방오차고계득도증강어음빈보,진이중구증강어음。실험결과표명,해산법적대수빈보거리치화주관어음질량평고타분균우우비부구진분해(NMF)화패협사비부구진분해(BNMF)등전통적어음증강산법,특별시재저신조비조건하,해산법증강적효과경가。
To improve the quality of the enhanced speech,a novel improved speech enhancement algorithm via traditional Bayesian nonnegative matrix factorization (BNMF)was proposed.The correlations of atoms in speech frame were considered in the improved BNMF (IBNMF).The proposed IBNMF consists of a training stage and an enhancing stage.During the training stage,the training sets of speech and noise were analyzed by IBNMF algorithm,and the dictionaries of speech and noise constructed.In the enhancing stage,combining the dictionaries of speech and noise,the coding matrix of speech was evaluated from the spectrum of noisy speech.Then,the spectrum of enhanced speech was derived by minimum mean square error (MMSE)estimator and the enhanced speech reconstructed.Experimental results show that the scores of log spectral distance(LSD)and perceptual evaluation of speech quality(PESQ)via IBNMF are better than the traditional speech enhancement methods,such as NMF and BNMF,especially in low SNR conditions.