TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Speech Enhancement	VoiceBank + DEMAND	SEMamba (+PCS)	PESQ	3.69	# 1
Speech Enhancement	VoiceBank + DEMAND	SEMamba (+PCS)	CSIG	4.79	# 2
Speech Enhancement	VoiceBank + DEMAND	SEMamba (+PCS)	CBAK	3.63	# 5
Speech Enhancement	VoiceBank + DEMAND	SEMamba (+PCS)	COVL	4.37	# 1
Speech Enhancement	VoiceBank + DEMAND	SEMamba (+PCS)	STOI	0.96	# 8
Speech Enhancement	VoiceBank + DEMAND	SEMamba (+PCS)	Para. (M)	2.25	# 6

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-investigation-of-incorporating-mamba-for/speech-enhancement-on-demand)](https://paperswithcode.com/sota/speech-enhancement-on-demand?p=an-investigation-of-incorporating-mamba-for)`

An Investigation of Incorporating Mamba for Speech Enhancement

10 May 2024 · Rong Chao, Wen-Huang Cheng, Moreno La Quatra, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Szu-Wei Fu, Yu Tsao ·

This work aims to study a scalable state-space model (SSM), Mamba, for the speech enhancement (SE) task. We exploit a Mamba-based regression model to characterize speech signals and build an SE system upon Mamba, termed SEMamba. We explore the properties of Mamba by integrating it as the core model in both basic and advanced SE systems, along with utilizing signal-level distances as well as metric-oriented loss functions. SEMamba demonstrates promising results and attains a PESQ score of 3.55 on the VoiceBank-DEMAND dataset. When combined with the perceptual contrast stretching technique, the proposed SEMamba yields a new state-of-the-art PESQ score of 3.69.

PDF Abstract

Code

Add Remove Mark official

roychao19477/semamba official

Tasks

Add Remove

Speech Enhancement

Datasets

VoiceBank + DEMAND

Results from the Paper

Edit

Ranked #1 on Speech Enhancement on VoiceBank + DEMAND

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Speech Enhancement	VoiceBank + DEMAND	SEMamba (+PCS)	PESQ	3.69	# 1	Compare
			CSIG	4.79	# 2	Compare
			CBAK	3.63	# 5	Compare
			COVL	4.37	# 1	Compare
			STOI	0.96	# 8	Compare
			Para. (M)	2.25	# 6	Compare

Methods

Add Remove

Mamba

Edit Social Preview

An Investigation of Incorporating Mamba for Speech Enhancement

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove