Optimization of Feature and Loss Exponents for Lightweight DNN-based Binaural Speech Enhancement

Conference: Speech Communication - 16th ITG Conference
09/24/2025 - 09/26/2025 at Berlin, Germany

Proceedings: ITG-Fb. 321: Speech Communication

Pages: 5Language: englishTyp: PDF

Authors:
Chinaev, Aleksej; Enzner, Gerald; Thaleiser, Stefan

Abstract:
Lightweight DNN-based binaural speech enhancement (BSE) requires careful system design to solve the trade-off between resource efficiency and performance. Using eight instrumental metrics for noise reduction, speech intelligibility and audio quality, the BSE components such as feature compression and loss function are optimized in a joint exploration of power-law and mean p-power error exponents, providing guidance on BSE design used for implementation and comparison of systems. The study uses standard 16 kHz and advanced 24 kHz sampling rates.