Search Results for author: Michael M. Goodwin

Found 6 papers, 0 papers with code

Sound Source Separation Using Latent Variational Block-Wise Disentanglement

no code implementations • 8 Feb 2024 • Karim Helwani, Masahito Togami, Paris Smaragdis, Michael M. Goodwin

In this paper, we present a hybrid classical digital signal processing/deep neural network (DSP/DNN) approach to source separation (SS) highlighting the theoretical link between variational autoencoder and classical approaches to SS.

Disentanglement

Paper
Add Code

Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure

no code implementations • 1 Feb 2024 • Masahito Togami, Jean-Marc Valin, Karim Helwani, Ritwik Giri, Umut Isik, Michael M. Goodwin

The algorithm runs in real-time on 10-ms frames with a 40 ms of look-ahead.

Speech Enhancement

Paper
Add Code

Neural Harmonium: An Interpretable Deep Structure for Nonlinear Dynamic System Identification with Application to Audio Processing

no code implementations • 10 Oct 2023 • Karim Helwani, Erfan Soltanmohammadi, Michael M. Goodwin

Improving the interpretability of deep neural networks has recently gained increased attention, especially when the power of deep learning is leveraged to solve problems in physics.

Acoustic echo cancellation Audio Signal Processing

Paper
Add Code

NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping

no code implementations • 25 Sep 2023 • Jan Büthe, Ahmed Mustafa, Jean-Marc Valin, Karim Helwani, Michael M. Goodwin

Speech codec enhancement methods are designed to remove distortions added by speech codecs.

Paper
Add Code

A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

no code implementations • 23 Feb 2023 • Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis

In this study, we present an approach to train a single speech enhancement network that can perform both personalized and non-personalized speech enhancement.

Multi-Task Learning Speech Enhancement

Paper
Add Code

Improved singing voice separation with chromagram-based pitch-aware remixing

no code implementations • 28 Mar 2022 • Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy

Singing voice separation aims to separate music into vocals and accompaniment components.

Data Augmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.