no code implementations • 7 May 2024 • Eloi Moliner, Jean-Marie Lemercier, Simon Welker, Timo Gerkmann, Vesa Välimäki
In this paper, we present an unsupervised single-channel method for joint blind dereverberation and room impulse response estimation, based on posterior sampling with diffusion models.
no code implementations • 15 Feb 2024 • Jean-Marie Lemercier, Julius Richter, Simon Welker, Eloi Moliner, Vesa Välimäki, Timo Gerkmann
Here, we aim to show that diffusion models can combine the best of both worlds and offer the opportunity to design audio restoration algorithms with a good degree of interpretability and a remarkable performance in terms of sound quality.
no code implementations • 14 Sep 2023 • Navin Raj Prabhu, Bunlong Lay, Simon Welker, Nale Lehmann-Willenbrock, Timo Gerkmann
Subsequently, at inference, a target emotion embedding is employed to convert the emotion of the input utterance to the given target emotion.
no code implementations • 14 Sep 2023 • Simon Welker, Tal Peer, Henry N. Chapman, Timo Gerkmann
In this work, we demonstrate that the ptychographic phase problem can be solved in a live fashion during scanning, while data is still being collected.
no code implementations • 13 Sep 2023 • Tal Peer, Simon Welker, Johannes Kolhoff, Timo Gerkmann
Several recent contributions in the field of iterative STFT phase retrieval have demonstrated that the performance of the classical Griffin-Lim method can be considerably improved upon.
1 code implementation • 21 Jun 2023 • Jean-Marie Lemercier, Simon Welker, Timo Gerkmann
We present in this paper an informed single-channel dereverberation method based on conditional generation with diffusion models.
no code implementations • 15 Mar 2023 • Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Tal Peer, Timo Gerkmann
In this paper, we present a causal speech signal improvement system that is designed to handle different types of distortions.
2 code implementations • 28 Feb 2023 • Bunlong Lay, Simon Welker, Julius Richter, Timo Gerkmann
Recently, score-based generative models have been successfully employed for the task of speech enhancement.
2 code implementations • 22 Dec 2022 • Jean-Marie Lemercier, Julius Richter, Simon Welker, Timo Gerkmann
As diffusion models are generative approaches they may also produce vocalizing and breathing artifacts in adverse conditions.
1 code implementation • 12 Nov 2022 • Simon Welker, Henry N. Chapman, Timo Gerkmann
In this work, we utilize the high-fidelity generation abilities of diffusion models to solve blind JPEG restoration at high compression levels.
no code implementations • 8 Nov 2022 • Tal Peer, Simon Welker, Timo Gerkmann
Diffusion probabilistic models have been recently used in a variety of tasks, including speech enhancement and synthesis.
1 code implementation • 4 Nov 2022 • Jean-Marie Lemercier, Julius Richter, Simon Welker, Timo Gerkmann
In this paper, we systematically compare the performance of generative diffusion models and discriminative approaches on different speech restoration tasks.
1 code implementation • IEEE/ACM Transactions on Audio, Speech, and Language Processing 2023 • Julius Richter, Simon Welker, Jean-Marie Lemercier, Bunlong Lay, Timo Gerkmann
This matches our forward process which moves from clean speech to noisy speech by including a drift term.
Ranked #20 on Speech Enhancement on VoiceBank + DEMAND
no code implementations • 11 May 2022 • Tal Peer, Simon Welker, Timo Gerkmann
Phase retrieval is a problem encountered not only in speech and audio processing, but in many other fields such as optics.
1 code implementation • 31 Mar 2022 • Simon Welker, Julius Richter, Timo Gerkmann
Score-based generative models (SGMs) have recently shown impressive results for difficult generative tasks such as the unconditional and conditional generation of natural images and audio signals.
no code implementations • 17 Feb 2022 • Simon Welker, Tal Peer, Henry N. Chapman, Timo Gerkmann
One of the most prominent challenges in the field of diffractive imaging is the phase retrieval (PR) problem: In order to reconstruct an object from its diffraction pattern, the inverse Fourier transform must be computed.
no code implementations • 11 Feb 2021 • Simon Welker, Muhamed Amin, Jochen Küpper
CMInject simulates nanoparticle injection experiments of particles with diameters in the micrometer to nanometer-regime, e. g., for single-particle-imaging experiments.
Computational Physics Fluid Dynamics Instrumentation and Detectors