CAJ | 학술논문

在W-分离正交性假设的语音盲分离方法中,由于没有考虑多个源信号同时存在的情况,导致分离信号中不可避免地存在音乐噪声.针对这种部分W-分离正交情况,提出了基于信道估计的语音盲分离方法.该方法先检测只有一个源信号存在的时频点并进行归一化处理,使得处理后的结果与频率无关,克服了W-分离正交性假设的不足以及频率置换问题,通过K-means聚类估计出信道,再结合信号子空间方法重构源信号.仿真结果表明,提出的方法可以有效减少分离语音中的音乐噪声,与典型的时频二元掩蔽方法相比,其平均信号失真比提高3.02 dB,同时平均信干比提高4.61 dB.
재W-분리정교성가설적어음맹분리방법중,유우몰유고필다개원신호동시존재적정황,도치분리신호중불가피면지존재음악조성.침대저충부분W-분리정교정황,제출료기우신도고계적어음맹분리방법.해방법선검측지유일개원신호존재적시빈점병진행귀일화처리,사득처리후적결과여빈솔무관,극복료W-분리정교성가설적불족이급빈솔치환문제,통과K-means취류고계출신도,재결합신호자공간방법중구원신호.방진결과표명,제출적방법가이유효감소분리어음중적음악조성,여전형적시빈이원엄폐방법상비,기평균신호실진비제고3.02 dB,동시평균신간비제고4.61 dB.
In blind speech separation methods based on the assumption of W-disjoint orthogonality (W-DO), musical noise is inevitable in separated signals because the assumption does not include the case of existing multiple source signals in the time-frequency domain. A blind speech separation method based on channel estimation is proposed for partial approximate W-disjoint orthogonality. The time-frequency cells with only one source are detected and normalized to be independent of frequency, which overcomes not only the shortcoming of W-DO property but also the frequency permutation problem, and then the channel estimation is obtained by K-means clustering. Finally, signal subspace method is exploited to reconstruct sources. Simulation results demonstrate that the novel method can effectively reduce the musical noise in the separated speech signals, and it outperforms the typical time frequency binary masking method, the averaged signal to distortion ratio (SDR) is improved by 3.02 dB and the averaged signal to interference ratio (SIR) is improved by 4.61 dB.