Theoretical Analysis of Musical Noise Generation in Noise Reduction Methods with Decision-Directed a Priori SNR Estimator

Konferenz: IWAENC 2012 - International Workshop on Acoustic Signal Enhancement
04.09.2012-06.09.2012 in Aachen, Germany

Tagungsband: IWAENC 2012

Seiten: 4Sprache: EnglischTyp: PDF

Persönliche VDE-Mitglieder erhalten auf diesen Artikel 10% Rabatt

Kanehara, Suzumi; Saruwatari, Hiroshi; Miyazaki, Ryoichi; Shikano, Kiyohiro (Nara Institute of Science and Technology, Nara, 630-0192 Japan)
Kondo, Kazunobu (YAMAHA Corporate Research & Development Center, Shizuoka, 438-0192 Japan)

In this paper, we propose a new theoretical analysis of the amount of musical noise generated in several noise reduction methods with a decision-directed a priori SNR estimator using higher-order statistics. In our previous study, a musical noise assessment based on kurtosis has been successfully applied to spectral subtraction. However, this approach cannot be applied to some high-quality noise reduction methods, e.g., Ephraim-Malah’s minimum mean-square error short-time spectral amplitude (MMSE STSA) estimator, because such methods include the decision-directed a priori SNR estimator, which corresponds to a nonlinear recursive (infinite) process for noise power spectral sequences. Therefore, in this paper, we introduce a computationally efficient higher-order-moment calculation method based on generalized Gauss-Laguerre quadrature. We also mathematically clarify the justification of using a typical decision-directed parameter, namely, magic number 0.98, in the MMSE STSA estimator from a viewpoint of amounts of musical noise and speech distortion. Index Terms — Noise reduction, musical noise, higher-order statistics, generalized Gauss-Laguerre quadrature.