no code implementations • 26 Jan 2023 • Dong-Jin Kim, Tae-Hyun Oh, Jinsoo Choi, In So Kweon
We present a novel data-efficient semi-supervised framework to improve the generalization of image captioning models.
no code implementations • 10 Nov 2021 • Jinsoo Choi, Jaesik Park, In So Kweon
Videos are a popular media form, where online video streaming has recently gathered much popularity.
no code implementations • 21 Oct 2021 • Dong-Jin Kim, Jae Won Cho, Jinsoo Choi, Yunjae Jung, In So Kweon
In this work, we address Active Learning in the multi-modal setting of Visual Question Answering (VQA).
1 code implementation • 9 Sep 2021 • Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, In So Kweon
A common problem in the task of human-object interaction (HOI) detection is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution.
Ranked #41 on Human-Object Interaction Detection on HICO-DET
no code implementations • 13 Apr 2021 • Jae Won Cho, Dong-Jin Kim, Jinsoo Choi, Yunjae Jung, In So Kweon
In this work, we address the issues of missing modalities that have arisen from the Visual Question Answer-Difference prediction task and find a novel method to solve the task at hand.
1 code implementation • 8 Oct 2020 • Dong-Jin Kim, Tae-Hyun Oh, Jinsoo Choi, In So Kweon
To this end, we propose the multi-task triple-stream network (MTTSNet) which consists of three recurrent units responsible for each POS which is trained by jointly predicting the correct captions and POS for each word.
1 code implementation • 17 Jul 2020 • Dong-Jin Kim, Xiao Sun, Jinsoo Choi, Stephen Lin, In So Kweon
A common problem in human-object interaction (HOI) detection task is that numerous HOI classes have only a small number of labeled examples, resulting in training sets with a long-tailed distribution.
1 code implementation • 5 Sep 2019 • Jinsoo Choi, In So Kweon
We present a novel deep approach to video stabilization which can generate video frames without cropping and low distortion.
no code implementations • IJCNLP 2019 • Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh, In So Kweon
To this end, our proposed semi-supervised learning method assigns pseudo-labels to unpaired samples via Generative Adversarial Networks to learn the joint distribution of image and caption.
1 code implementation • CVPR 2019 • Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh, In So Kweon
Our goal in this work is to train an image captioning model that generates more dense and informative captions.
no code implementations • 14 Feb 2018 • Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh, Youngjin Yoon, In So Kweon
Human behavior understanding is arguably one of the most important mid-level components in artificial intelligence.
no code implementations • 6 Feb 2017 • Jinsoo Choi, Tae-Hyun Oh, In So Kweon
Despite the challenging baselines, our method still manages to show comparable or even exceeding performance.
no code implementations • CVPR 2016 • Jinsoo Choi, Tae-Hyun Oh, In So Kweon
Inspired by plot analysis of written stories, our method generates a sequence of video clips ordered in such a way that it reflects plot dynamics and content coherency.
no code implementations • 12 Jan 2016 • Jinsoo Choi, Tae-Hyun Oh, In So Kweon
Photo collections and its applications today attempt to reflect user interactions in various forms.