论文标题
使用Stoi-optimal口罩增强双耳语音
Binaural Speech Enhancement Using STOI-Optimal Masks
论文作者
论文摘要
以前已经提出并开发了用于单渠道语音增强的Stoi-Timal掩蔽。在本文中,我们考虑了对双耳语音增强任务的扩展,其中已知空间信息对语音理解很重要,因此应通过增强处理来保留。估计每个双耳通道的掩模,并通过选择两个掩码的最大值来计算“更好的听力”掩码。估计的掩码用于提供有关每个时间频箱中语音存在的概率信息,以达到最佳修饰的对数光谱振幅(OM-LSA)增强器。我们表明,使用提出的具有定向噪声的双耳信号的方法不仅可以改善嘈杂信号的SNR,而且还可以保留双耳线索和清晰度。
STOI-optimal masking has been previously proposed and developed for single-channel speech enhancement. In this paper, we consider the extension to the task of binaural speech enhancement in which spatial information is known to be important to speech understanding and therefore should be preserved by the enhancement processing. Masks are estimated for each of the binaural channels individually and a `better-ear listening' mask is computed by choosing the maximum of the two masks. The estimated mask is used to supply probability information about the speech presence in each time-frequency bin to an Optimally-modified Log Spectral Amplitude (OM-LSA) enhancer. We show that using the proposed method for binaural signals with a directional noise not only improves the SNR of the noisy signal but also preserves the binaural cues and intelligibility.