Search Results for author: Ritwik Giri

Found 15 papers, 1 papers with code

Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure

no code implementations • 1 Feb 2024 • Masahito Togami, Jean-Marc Valin, Karim Helwani, Ritwik Giri, Umut Isik, Michael M. Goodwin

The algorithm runs in real-time on 10-ms frames with a 40 ms of look-ahead.

Speech Enhancement

Paper
Add Code

A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

no code implementations • 23 Feb 2023 • Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis

In this study, we present an approach to train a single speech enhancement network that can perform both personalized and non-personalized speech enhancement.

Multi-Task Learning Speech Enhancement

Paper
Add Code

Semi-supervised Time Domain Target Speaker Extraction with Attention

no code implementations • 18 Jun 2022 • Zhepei Wang, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Jean-Marc Valin, Paris Smaragdis, Mike Goodwin, Arvindh Krishnaswamy

In this work, we propose Exformer, a time-domain architecture for target speaker extraction.

Target Speaker Extraction

Paper
Add Code

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

no code implementations • 16 Jun 2022 • Jean-Marc Valin, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Arvindh Krishnaswamy

In real life, room effect, also known as room reverberation, and the present background noise degrade the quality of speech.

Denoising Speech Enhancement

Paper
Add Code

Improved singing voice separation with chromagram-based pitch-aware remixing

no code implementations • 28 Mar 2022 • Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy

Singing voice separation aims to separate music into vocals and accompaniment components.

Data Augmentation

Paper
Add Code

Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement

no code implementations • 8 Jun 2021 • Ritwik Giri, Shrikant Venkataramani, Jean-Marc Valin, Umut Isik, Arvindh Krishnaswamy

The presence of multiple talkers in the surrounding environment poses a difficult challenge for real-time speech communication systems considering the constraints on network size and complexity.

Paper
Add Code

Semi-Supervised Singing Voice Separation with Noisy Self-Training

no code implementations • 16 Feb 2021 • Zhepei Wang, Ritwik Giri, Umut Isik, Jean-Marc Valin, Arvindh Krishnaswamy

Given a limited set of labeled data, we present a method to leverage a large volume of unlabeled data to improve the model's performance.

Data Augmentation

Paper
Add Code

Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders

no code implementations • 12 Feb 2021 • Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy

Audio codecs based on discretized neural autoencoders have recently been developed and shown to provide significantly higher compression levels for comparable quality speech output.

Paper
Add Code

PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss

no code implementations • 11 Aug 2020 • Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy

Neural network applications generally benefit from larger-sized models, but for current speech enhancement models, larger scale networks often suffer from decreased robustness to the variety of real-world use cases beyond what is encountered in training data.

Ranked #8 on Speech Enhancement on Deep Noise Suppression (DNS) Challenge

Speech Enhancement

Paper
Add Code

Channel-Attention Dense U-Net for Multichannel Speech Enhancement

1 code implementation • 30 Jan 2020 • Bahareh Tolooshams, Ritwik Giri, Andrew H. Song, Umut Isik, Arvindh Krishnaswamy

Supervised deep learning has gained significant attention for speech enhancement recently.

Ranked #2 on Speech Enhancement on CHiME-3

Speech Enhancement

Paper
Code

From Speech-to-Speech Translation to Automatic Dubbing

no code implementations • WS 2020 • Marcello Federico, Robert Enyedi, Roberto Barra-Chicote, Ritwik Giri, Umut Isik, Arvindh Krishnaswamy, Hassan Sawaf

We present enhancements to a speech-to-speech translation pipeline in order to perform automatic dubbing.

Machine Translation Speech-to-Speech Translation +1

Paper
Add Code

Relevance Subject Machine: A Novel Person Re-identification Framework

no code implementations • 30 Mar 2017 • Igor Fedorov, Ritwik Giri, Bhaskar D. Rao, Truong Q. Nguyen

We propose a novel method called the Relevance Subject Machine (RSM) to solve the person re-identification (re-id) problem.

Person Re-Identification

Paper
Add Code

Robust Bayesian Method for Simultaneous Block Sparse Signal Recovery with Applications to Face Recognition

no code implementations • 6 May 2016 • Igor Fedorov, Ritwik Giri, Bhaskar D. Rao, Truong Q. Nguyen

In this paper, we present a novel Bayesian approach to recover simultaneously block sparse signals in the presence of outliers.

Face Recognition

Paper
Add Code

A Unified Framework for Sparse Non-Negative Least Squares using Multiplicative Updates and the Non-Negative Matrix Factorization Problem

no code implementations • 7 Apr 2016 • Igor Fedorov, Alican Nalci, Ritwik Giri, Bhaskar D. Rao, Truong Q. Nguyen, Harinath Garudadri

We show that the proposed framework encompasses a large class of S-NNLS algorithms and provide a computationally efficient inference procedure based on multiplicative update rules.

Paper
Add Code

Type I and Type II Bayesian Methods for Sparse Signal Recovery using Scale Mixtures

no code implementations • 17 Jul 2015 • Ritwik Giri, Bhaskar D. Rao

In this paper, we propose a generalized scale mixture family of distributions, namely the Power Exponential Scale Mixture (PESM) family, to model the sparsity inducing priors currently in use for sparse signal recovery (SSR).

Vocal Bursts Type Prediction

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.