1 code implementation • ICCV 2023 • Liangqi Li, Jiaxu Miao, Dahu Shi, Wenming Tan, Ye Ren, Yi Yang, ShiLiang Pu
Current methods for open-vocabulary object detection (OVOD) rely on a pre-trained vision-language model (VLM) to acquire the recognition ability.
1 code implementation • ICCV 2023 • Heng Zhao, Shenxing Wei, Dahu Shi, Wenming Tan, Zheyang Li, Ye Ren, Xing Wei, Yi Yang, ShiLiang Pu
Taking the symmetry properties of objects into consideration, we design a symmetry-aware matching loss to facilitate the learning of dense point-wise geometry features and improve the performance considerably.
1 code implementation • CVPR 2023 2023 • Xing Wei, Yifan Bai, Yongchao Zheng, Dahu Shi, Yihong Gong
We present ARTrack, an autoregressive framework for visual object tracking.
Ranked #1 on Visual Tracking on TNL2K
1 code implementation • CVPR 2022 • Dahu Shi, Xing Wei, Liangqi Li, Ye Ren, Wenming Tan
Current methods of multi-person pose estimation typically treat the localization and association of body joints separately.
no code implementations • 31 Dec 2021 • Xing Wei, Yuanrui Kang, Jihao Yang, Yunfeng Qiu, Dahu Shi, Wenming Tan, Yihong Gong
First of all, we design a deformable attention in-built Transformer backbone, which learns adaptive feature representations with deformable sampling locations and dynamic attention weights.
1 code implementation • 21 Dec 2021 • Xiaodong Yu, Dahu Shi, Xing Wei, Ye Ren, Tingqun Ye, Wenming Tan
The pixel-wise mask, especially, is embedded by a group of parameters to construct a lightweight instance-aware transformer.
1 code implementation • 19 Jul 2021 • Dahu Shi, Xing Wei, Xiaodong Yu, Wenming Tan, Ye Ren, ShiLiang Pu
Multi-person pose estimation is an attractive and challenging task.
Ranked #4 on Multi-Person Pose Estimation on COCO minival