1 code implementation • ICLR 2022 • Byungseok Roh, Jaewoong Shin, Wuhyun Shin, Saehoon Kim
Deformable DETR uses the multiscale feature to ameliorate performance, however, the number of encoder tokens increases by 20x compared to DETR, and the computation cost of the encoder attention remains a bottleneck.
2 code implementations • CVPR 2021 • Byungseok Roh, Wuhyun Shin, Ildoo Kim, Sungwoong Kim
While these contrastive methods mainly focus on generating invariant global representations at the image-level under semantic-preserving transformations, they are prone to overlook spatial consistency of local representations and therefore have a limitation in pretraining for localization tasks such as object detection and instance segmentation.