no code implementations • 17 Feb 2024 • Jiwon Yoo, JangWon Lee, Gyeonghwan Kim
Multi-scale architecture, including hierarchical vision transformer, has been commonly applied to high-resolution semantic segmentation to deal with computational complexity with minimum performance loss.
no code implementations • 20 May 2017 • Chenyou Fan, JangWon Lee, Michael S. Ryoo
The key idea is that (1) an intermediate representation of a convolutional object recognition model abstracts scene information in its frame and that (2) we can predict (i. e., regress) such representations corresponding to the future frames based on that of the current frame.