no code implementations • CVPR 2022 • Hyoung-Kyu Song, Sang Hoon Woo, Junhyeok Lee, Seungmin Yang, Hyunjae Cho, Youseong Lee, Dongho Choi, Kang-wook Kim
In this work, we propose a joint system combining a talking face generation system with a text-to-speech system that can generate multilingual talking face videos from only the text input.
1 code implementation • 20 Apr 2022 • Ki-Ung Song, Dongseok Shim, Kang-wook Kim, Jae-young Lee, Younggeun Kim
Super-resolution suffers from an innate ill-posed problem that a single low-resolution (LR) image can be from multiple high-resolution (HR) images.
1 code implementation • 25 Oct 2021 • Kang-wook Kim, Junhyeok Lee
We propose a singing decomposition system that encodes time-aligned linguistic content, pitch, and source speaker identity via Assem-VC.
1 code implementation • 2 Apr 2021 • Kang-wook Kim, Seung-won Park, Junhyeok Lee, Myun-chul Joe
Recent works on voice conversion (VC) focus on preserving the rhythm and the intonation as well as the linguistic content.