Search Results for author: Shifeng Pan

Found 4 papers, 2 papers with code

InferGrad: Improving Diffusion Models for Vocoder by Considering Inference in Training

no code implementations • 8 Feb 2022 • Zehua Chen, Xu Tan, Ke Wang, Shifeng Pan, Danilo Mandic, Lei He, Sheng Zhao

In this paper, we propose InferGrad, a diffusion model for vocoder that incorporates inference process into training, to reduce the inference iterations while maintaining high generation quality.

Denoising

Paper
Add Code

Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis

no code implementations • 27 Jul 2021 • Shifeng Pan, Lei He

Secondly, in these models the content/text, prosody, and speaker timbre are usually highly entangled, it's therefore not realistic to expect a satisfied result when freely combining these components, such as to transfer speaking style between speakers.

Expressive Speech Synthesis Style Transfer

Paper
Add Code

Forward-Backward Decoding for Regularizing End-to-End TTS

1 code implementation • 18 Jul 2019 • Yibin Zheng, Xi Wang, Lei He, Shifeng Pan, Frank K. Soong, Zhengqi Wen, Jian-Hua Tao

Experimental results show our proposed methods especially the second one (bidirectional decoder regularization), leads a significantly improvement on both robustness and overall naturalness, as outperforming baseline (the revised version of Tacotron2) with a MOS gap of 0. 14 in a challenging test, and achieving close to human quality (4. 42 vs. 4. 49 in MOS) on general test.

Decoder

30,039

Paper
Code

Learning latent representations for style control and transfer in end-to-end speech synthesis

2 code implementations • 11 Dec 2018 • Ya-Jie Zhang, Shifeng Pan, Lei He, Zhen-Hua Ling

In this paper, we introduce the Variational Autoencoder (VAE) to an end-to-end speech synthesis model, to learn the latent representation of speaking styles in an unsupervised manner.

Speech Synthesis Style Transfer

166

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.