no code implementations • 4 Dec 2023 • Martin Strauss, Nicola Pia, Nagashree K. S. Rao, Bernd Edler
This paper proposes SEFGAN, a Deep Neural Network (DNN) combining maximum likelihood training and Generative Adversarial Networks (GANs) for efficient speech enhancement (SE).
no code implementations • 30 May 2023 • Luca Resti, Martin Strauss, Matteo Torcoli, Emanuël Habets, Bernd Edler
When individual audio stems are unavailable from production, Dialogue Separation (DS) can be applied to the final audio mixture to obtain estimates of these stems.
no code implementations • 21 Oct 2022 • Martin Strauss, Matteo Torcoli, Bernd Edler
Deep generative models for Speech Enhancement (SE) received increasing attention in recent years.
no code implementations • 16 Jun 2021 • Martin Strauss, Bernd Edler
Speech enhancement involves the distinction of a target speech signal from an intrusive background.
no code implementations • 16 Jun 2021 • Martin Strauss, Jouni Paulus, Matteo Torcoli, Bernd Edler
The music separation models are selected as they share the number of channels (2) and sampling rate (44. 1 kHz or higher) with the considered broadcast content, and vocals separation in music is considered as a parallel for dialog separation in the target application domain.