no code implementations • 18 Apr 2024 • Thibault Castells, Hyoung-Kyu Song, Bo-Kyeong Kim, Shinkook Choi
Latent Diffusion Models (LDMs) have emerged as powerful generative models, known for delivering remarkable results under constrained computational resources.
no code implementations • 18 Apr 2024 • Thibault Castells, Hyoung-Kyu Song, Tairen Piao, Shinkook Choi, Bo-Kyeong Kim, Hanyoung Yim, Changgwun Lee, Jae Gon Kim, Tae-Ho Kim
The intensive computational burden of Stable Diffusion (SD) for text-to-image generation poses a significant hurdle for its practical application.
no code implementations • 28 Feb 2024 • Changho Choi, Minho Kim, Junhyeok Lee, Hyoung-Kyu Song, Younggeun Kim, Seungryong Kim
We show that our framework is applicable to other generators such as StyleNeRF, paving a way to 3D-aware face swapping and is also compatible with other downstream StyleGAN2 generator tasks.
no code implementations • 5 Feb 2024 • Bo-Kyeong Kim, Geonmin Kim, Tae-Ho Kim, Thibault Castells, Shinkook Choi, Junho Shin, Hyoung-Kyu Song
Structured pruning of modern large language models (LLMs) has emerged as a way of decreasing their high computational needs.
3 code implementations • 25 May 2023 • Bo-Kyeong Kim, Hyoung-Kyu Song, Thibault Castells, Shinkook Choi
Text-to-image (T2I) generation with Stable Diffusion models (SDMs) involves high computing demands due to billion-scale parameters.
DreamBooth Personalized Generation Image-to-Image Translation
no code implementations • 2 Apr 2023 • Bo-Kyeong Kim, Jaemin Kang, Daeun Seo, Hancheol Park, Shinkook Choi, Hyoung-Kyu Song, Hyungshin Kim, Sungsu Lim
Virtual humans have gained considerable attention in numerous industries, e. g., entertainment and e-commerce.
no code implementations • CVPR 2022 • Hyoung-Kyu Song, Sang Hoon Woo, Junhyeok Lee, Seungmin Yang, Hyunjae Cho, Youseong Lee, Dongho Choi, Kang-wook Kim
In this work, we propose a joint system combining a talking face generation system with a text-to-speech system that can generate multilingual talking face videos from only the text input.
no code implementations • 3 Sep 2019 • Hyoung-Kyu Song, Ebrahim AlAlkeem, Jaewoong Yun, Tae-Ho Kim, Hyerin Yoo, Dasom Heo, Chan Yeob Yeun, Myungsu Chae
Most research has only focused on single modality or a single task, while the combination of input modality or tasks is yet to be investigated.