Multi-band MelGAN: Faster Waveform Generation for High-Quality Text-to-Speech

Interspeech2020 2020 Geng Yang Shan Yang Kai Liu Peng Fang Wei Chen Lei Xie

In this paper, we propose multi-band MelGAN, a much faster waveform generation model targeting to high-quality text-to-speech. Specifically, we improve the original MelGAN by the following aspects... (read more)

PDF Abstract

Categories


  • SOUND
  • AUDIO AND SPEECH PROCESSING