no code implementations • 27 Mar 2023 • Yi-Ting Lee, Da-Yi Wu, Chih-Chun Yang, Shou-De Lin
The goal of this paper is to report certain scientific discoveries about a Seq2Seq model.
no code implementations • 29 Jul 2022 • Da-Rong Liu, Po-chun Hsu, Yi-Chen Chen, Sung-Feng Huang, Shun-Po Chuang, Da-Yi Wu, Hung-Yi Lee
GAN training is adopted in the first stage to find the mapping relationship between unpaired speech and phone sequence.
1 code implementation • 31 Oct 2020 • Yen-Hao Chen, Da-Yi Wu, Tsung-Han Wu, Hung-Yi Lee
With a proper activation as an information bottleneck on content embeddings, the trade-off between the synthesis quality and the speaker similarity of the converted speech is improved drastically.
Audio and Speech Processing Sound
1 code implementation • 7 Jun 2020 • Da-Yi Wu, Yen-Hao Chen, Hung-Yi Lee
Voice conversion (VC) is a task that transforms the source speaker's timbre, accent, and tones in audio into another one's while preserving the linguistic content.
no code implementations • 28 May 2020 • Da-Yi Wu, Yi-Hsuan Yang
Specifically, given a speech input, and optionally the F0 contour of the target singing, the proposed model generates as the output a singing signal with a progressive-growing encoder/decoder architecture and boundary equilibrium GAN loss functions.