电子科技大学学报
電子科技大學學報
전자과기대학학보
JOURNAL OF UNIVERSITY OF ELECTRONIC SCIENCE AND TECHNOLOGY OF CHINA
2010年
2期
186-190
,共5页
信道估计%部分W-分离正交%主分量分析%语音分离
信道估計%部分W-分離正交%主分量分析%語音分離
신도고계%부분W-분리정교%주분량분석%어음분리
channel estimation%partial W-disjoint orthogonality%principal component analysis%speech separation
在W-分离正交性假设的语音盲分离方法中,由于没有考虑多个源信号同时存在的情况,导致分离信号中不可避免地存在音乐噪声.针对这种部分W-分离正交情况,提出了基于信道估计的语音盲分离方法.该方法先检测只有一个源信号存在的时频点并进行归一化处理,使得处理后的结果与频率无关,克服了W-分离正交性假设的不足以及频率置换问题,通过K-means聚类估计出信道,再结合信号子空间方法重构源信号.仿真结果表明,提出的方法可以有效减少分离语音中的音乐噪声,与典型的时频二元掩蔽方法相比,其平均信号失真比提高3.02 dB,同时平均信干比提高4.61 dB.
在W-分離正交性假設的語音盲分離方法中,由于沒有攷慮多箇源信號同時存在的情況,導緻分離信號中不可避免地存在音樂譟聲.針對這種部分W-分離正交情況,提齣瞭基于信道估計的語音盲分離方法.該方法先檢測隻有一箇源信號存在的時頻點併進行歸一化處理,使得處理後的結果與頻率無關,剋服瞭W-分離正交性假設的不足以及頻率置換問題,通過K-means聚類估計齣信道,再結閤信號子空間方法重構源信號.倣真結果錶明,提齣的方法可以有效減少分離語音中的音樂譟聲,與典型的時頻二元掩蔽方法相比,其平均信號失真比提高3.02 dB,同時平均信榦比提高4.61 dB.
재W-분리정교성가설적어음맹분리방법중,유우몰유고필다개원신호동시존재적정황,도치분리신호중불가피면지존재음악조성.침대저충부분W-분리정교정황,제출료기우신도고계적어음맹분리방법.해방법선검측지유일개원신호존재적시빈점병진행귀일화처리,사득처리후적결과여빈솔무관,극복료W-분리정교성가설적불족이급빈솔치환문제,통과K-means취류고계출신도,재결합신호자공간방법중구원신호.방진결과표명,제출적방법가이유효감소분리어음중적음악조성,여전형적시빈이원엄폐방법상비,기평균신호실진비제고3.02 dB,동시평균신간비제고4.61 dB.
In blind speech separation methods based on the assumption of W-disjoint orthogonality (W-DO), musical noise is inevitable in separated signals because the assumption does not include the case of existing multiple source signals in the time-frequency domain. A blind speech separation method based on channel estimation is proposed for partial approximate W-disjoint orthogonality. The time-frequency cells with only one source are detected and normalized to be independent of frequency, which overcomes not only the shortcoming of W-DO property but also the frequency permutation problem, and then the channel estimation is obtained by K-means clustering. Finally, signal subspace method is exploited to reconstruct sources. Simulation results demonstrate that the novel method can effectively reduce the musical noise in the separated speech signals, and it outperforms the typical time frequency binary masking method, the averaged signal to distortion ratio (SDR) is improved by 3.02 dB and the averaged signal to interference ratio (SIR) is improved by 4.61 dB.