计算机工程与应用
計算機工程與應用
계산궤공정여응용
COMPUTER ENGINEERING AND APPLICATIONS
2014年
2期
204-207,230
,共5页
浊音隶属度%模糊聚类%矢量量化%F-LBG算法%声码器
濁音隸屬度%模糊聚類%矢量量化%F-LBG算法%聲碼器
탁음대속도%모호취류%시량양화%F-LBG산법%성마기
voiced membership%fuzzy clustering%Vector Quantization(VQ)%F-LBG algorithm%vocoder
为了克服低速率声码器因清浊音硬判决、粗判决而导致解码语音有帧过渡等不自然感的缺陷,在分析比较目前主流声码器编码算法中激励参数提取和量化算法的基础上,将模糊数学中的隶属度概念引入语音子带清浊音描述中,提出了5维的浊音隶属度矢量概念,用于精细描述语音丰富的激励信息;介绍了浊音隶属度矢量的提取算法;提出了矢量量化码本的模糊聚类与LBG级联训练算法(F-LBG);用提取算法提取、建立了浊音隶属度码本的训练样本集,用F-LBG训练了浊音隶属度码本;将提取算法和F-LBG法训练得到的浊音隶属度码本分别应用于正弦激励声码器、混合激励声码器和同态声码器进行语音编、解码仿真;结果表明,用浊音隶属度矢量描述和合成语音激励信号的算法,具有较高的准确性和较强的噪声鲁棒性。
為瞭剋服低速率聲碼器因清濁音硬判決、粗判決而導緻解碼語音有幀過渡等不自然感的缺陷,在分析比較目前主流聲碼器編碼算法中激勵參數提取和量化算法的基礎上,將模糊數學中的隸屬度概唸引入語音子帶清濁音描述中,提齣瞭5維的濁音隸屬度矢量概唸,用于精細描述語音豐富的激勵信息;介紹瞭濁音隸屬度矢量的提取算法;提齣瞭矢量量化碼本的模糊聚類與LBG級聯訓練算法(F-LBG);用提取算法提取、建立瞭濁音隸屬度碼本的訓練樣本集,用F-LBG訓練瞭濁音隸屬度碼本;將提取算法和F-LBG法訓練得到的濁音隸屬度碼本分彆應用于正絃激勵聲碼器、混閤激勵聲碼器和同態聲碼器進行語音編、解碼倣真;結果錶明,用濁音隸屬度矢量描述和閤成語音激勵信號的算法,具有較高的準確性和較彊的譟聲魯棒性。
위료극복저속솔성마기인청탁음경판결、조판결이도치해마어음유정과도등불자연감적결함,재분석비교목전주류성마기편마산법중격려삼수제취화양화산법적기출상,장모호수학중적대속도개념인입어음자대청탁음묘술중,제출료5유적탁음대속도시량개념,용우정세묘술어음봉부적격려신식;개소료탁음대속도시량적제취산법;제출료시량양화마본적모호취류여LBG급련훈련산법(F-LBG);용제취산법제취、건립료탁음대속도마본적훈련양본집,용F-LBG훈련료탁음대속도마본;장제취산법화F-LBG법훈련득도적탁음대속도마본분별응용우정현격려성마기、혼합격려성마기화동태성마기진행어음편、해마방진;결과표명,용탁음대속도시량묘술화합성어음격려신호적산법,구유교고적준학성화교강적조성로봉성。
In order to overcome the defect of unnaturalness caused by hard decisions of voiced/unvoiced speech segments in LPC based low bit rate vocoder, a 5-dimensional voiced membership vector for accurately describing the information of excitation is proposed on the basis of analyzing and comparing the algorithms which are used by today’s mainstream vocoders for extracting and quantifying parameters of the excitation signal. This paper introduces the concept of member-ship from fuzzy mathematics, describes the extracting algorithm of voiced membership vector and the VQ codebook of fuzzy clustering, and the LBG cascade training. Computer simulation experiments are conducted. The results indicate that this algorithm, using voiced membership vector to describe and synthesize the excitation signal, can achieve a very high accuracy and strong noise robustness when it is used to simulate the speech coding and decoding process for sinusoidal ex-citation vocoder, mixed-excitation vocoder and the homomorphic vocoder.