no code implementations • 8 Feb 2024 • Karim Helwani, Masahito Togami, Paris Smaragdis, Michael M. Goodwin
In this paper, we present a hybrid classical digital signal processing/deep neural network (DSP/DNN) approach to source separation (SS) highlighting the theoretical link between variational autoencoder and classical approaches to SS.
no code implementations • 1 Feb 2024 • Masahito Togami, Jean-Marc Valin, Karim Helwani, Ritwik Giri, Umut Isik, Michael M. Goodwin
The algorithm runs in real-time on 10-ms frames with a 40 ms of look-ahead.
no code implementations • 10 Oct 2023 • Karim Helwani, Erfan Soltanmohammadi, Michael M. Goodwin
Improving the interpretability of deep neural networks has recently gained increased attention, especially when the power of deep learning is leveraged to solve problems in physics.
no code implementations • 25 Sep 2023 • Jan Büthe, Ahmed Mustafa, Jean-Marc Valin, Karim Helwani, Michael M. Goodwin
Speech codec enhancement methods are designed to remove distortions added by speech codecs.
no code implementations • 23 Feb 2023 • Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis
In this study, we present an approach to train a single speech enhancement network that can perform both personalized and non-personalized speech enhancement.
no code implementations • 28 Mar 2022 • Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy
Singing voice separation aims to separate music into vocals and accompaniment components.