no code implementations • 2 Dec 2020 • Jing Su, Chenghua Lin, Mian Zhou, Qingyun Dai, Haoyu Lv
In this paper, we propose an end-to-end CNN-LSTM model for generating descriptions for sequential images with a local-object attention mechanism.
no code implementations • WS 2018 • Jing Su, Chenghua Lin, Mian Zhou, Qingyun Dai, Haoyu Lv
Image Captioning Text Generation