东南大学学报(英文版)
東南大學學報(英文版)
동남대학학보(영문판)
JOURNAL OF SOUTHEAST UNIVERSITY
2009年
2期
152-155
,共4页
王仕奎%汤一彬%尤红岩%吴镇扬
王仕奎%湯一彬%尤紅巖%吳鎮颺
왕사규%탕일빈%우홍암%오진양
丢帧补偿%宽带语音%大型隐马尔可夫模型%ISF参数
丟幀補償%寬帶語音%大型隱馬爾可伕模型%ISF參數
주정보상%관대어음%대형은마이가부모형%ISF삼수
frame erasure concealment%wideband speech%large hidden Markov model%immittance spectral frequency(ISF) pa-rameter
研究了在语音传输过程中由于参数丢失导致语音质量急剧下降的丢帧补偿问题.利用大规模隐式马尔可夫模型对自适应多速率宽带语音编码(AMR-WB)的ISF参数进行建模,然后对丢失的ISF参数进行基于最小均方误差(MMSE)准则的最优估计,将估计的ISF参数和前帧的ISF参数进行加权以平滑估计值,得到补偿的ISF参数.在接收端,利用ISF参数的估计值进行语音合成.将本算法的合成语音和由G.722.2标准附件Ⅰ的基准补偿的合成语音进行比较,仿真结果表明,本补偿算法可以得到更好的性能,在频率加权谱失真和信噪比这2种评价准则上都有所改善,信噪比提高约2.41 dB,频率加权谱失真下降约0.885 dB,证明了该算法的有效性.
研究瞭在語音傳輸過程中由于參數丟失導緻語音質量急劇下降的丟幀補償問題.利用大規模隱式馬爾可伕模型對自適應多速率寬帶語音編碼(AMR-WB)的ISF參數進行建模,然後對丟失的ISF參數進行基于最小均方誤差(MMSE)準則的最優估計,將估計的ISF參數和前幀的ISF參數進行加權以平滑估計值,得到補償的ISF參數.在接收耑,利用ISF參數的估計值進行語音閤成.將本算法的閤成語音和由G.722.2標準附件Ⅰ的基準補償的閤成語音進行比較,倣真結果錶明,本補償算法可以得到更好的性能,在頻率加權譜失真和信譟比這2種評價準則上都有所改善,信譟比提高約2.41 dB,頻率加權譜失真下降約0.885 dB,證明瞭該算法的有效性.
연구료재어음전수과정중유우삼수주실도치어음질량급극하강적주정보상문제.이용대규모은식마이가부모형대자괄응다속솔관대어음편마(AMR-WB)적ISF삼수진행건모,연후대주실적ISF삼수진행기우최소균방오차(MMSE)준칙적최우고계,장고계적ISF삼수화전정적ISF삼수진행가권이평활고계치,득도보상적ISF삼수.재접수단,이용ISF삼수적고계치진행어음합성.장본산법적합성어음화유G.722.2표준부건Ⅰ적기준보상적합성어음진행비교,방진결과표명,본보상산법가이득도경호적성능,재빈솔가권보실진화신조비저2충평개준칙상도유소개선,신조비제고약2.41 dB,빈솔가권보실진하강약0.885 dB,증명료해산법적유효성.
Frame erasure concealment is studied to solve the problem of rapid speech quality reduction due to the loss of speech parameters during speech transmission. A large hidden Markov model is applied to model the immittance spectral frequency (ISF) parameters in AMR-WB codec to optimally estimate the lost ISFs based on the minimum mean square error (MMSE) rule. The estimated ISFs are weighted with the ones of their previous neighbors to smooth the speech, resulting in the actual concealed ISF vectors. They are used instead of the lost ISFs in the speech synthesis on the receiver. Comparison is made between the speech concealed by this algorithm and by Annex I of G. 722. 2 specification, and simulation shows that the proposed concealment algorithm can lead to better performance in terms of frequency-weighted spectral distortion and signal-to-noise ratio compared to the baseline method, with an increase of 2. 41 dB in signal-to-noise ratio (SNR) and a reduction of 0. 885 dB in frequency-weighted spectral distortion.