Search Results for author: Martin Strauss

Found 5 papers, 0 papers with code

SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement

no code implementations4 Dec 2023 Martin Strauss, Nicola Pia, Nagashree K. S. Rao, Bernd Edler

This paper proposes SEFGAN, a Deep Neural Network (DNN) combining maximum likelihood training and Generative Adversarial Networks (GANs) for efficient speech enhancement (SE).

Audio Generation Speech Enhancement

Predicting Preferred Dialogue-to-Background Loudness Difference in Dialogue-Separated Audio

no code implementations30 May 2023 Luca Resti, Martin Strauss, Matteo Torcoli, Emanuël Habets, Bernd Edler

When individual audio stems are unavailable from production, Dialogue Separation (DS) can be applied to the final audio mixture to obtain estimates of these stems.

A Flow-Based Neural Network for Time Domain Speech Enhancement

no code implementations16 Jun 2021 Martin Strauss, Bernd Edler

Speech enhancement involves the distinction of a target speech signal from an intrusive background.

Density Estimation Speech Enhancement +1

A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation

no code implementations16 Jun 2021 Martin Strauss, Jouni Paulus, Matteo Torcoli, Bernd Edler

The music separation models are selected as they share the number of channels (2) and sampling rate (44. 1 kHz or higher) with the considered broadcast content, and vocals separation in music is considered as a parallel for dialog separation in the target application domain.

Music Source Separation Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.