Search Results for author: Dahu Shi

Found 7 papers, 6 papers with code

Distilling DETR with Visual-Linguistic Knowledge for Open-Vocabulary Object Detection

1 code implementation • ICCV 2023 • Liangqi Li, Jiaxu Miao, Dahu Shi, Wenming Tan, Ye Ren, Yi Yang, ShiLiang Pu

Current methods for open-vocabulary object detection (OVOD) rely on a pre-trained vision-language model (VLM) to acquire the recognition ability.

Knowledge Distillation Language Modelling +2

136

Paper
Code

Learning Symmetry-Aware Geometry Correspondences for 6D Object Pose Estimation

1 code implementation • ICCV 2023 • Heng Zhao, Shenxing Wei, Dahu Shi, Wenming Tan, Zheyang Li, Ye Ren, Xing Wei, Yi Yang, ShiLiang Pu

Taking the symmetry properties of objects into consideration, we design a symmetry-aware matching loss to facilitate the learning of dense point-wise geometry features and improve the performance considerably.

6D Pose Estimation 6D Pose Estimation using RGB +3

Paper
Code

Autoregressive Visual Tracking

1 code implementation • CVPR 2023 2023 • Xing Wei, Yifan Bai, Yongchao Zheng, Dahu Shi, Yihong Gong

We present ARTrack, an autoregressive framework for visual object tracking.

Ranked #1 on Visual Tracking on TNL2K

Object Template Matching +2

191

Paper
Code

End-to-End Multi-Person Pose Estimation With Transformers

1 code implementation • CVPR 2022 • Dahu Shi, Xing Wei, Liangqi Li, Ye Ren, Wenming Tan

Current methods of multi-person pose estimation typically treat the localization and association of body joints separately.

Decoder Multi-Person Pose Estimation

136

Paper
Code

Scene-Adaptive Attention Network for Crowd Counting

no code implementations • 31 Dec 2021 • Xing Wei, Yuanrui Kang, Jihao Yang, Yunfeng Qiu, Dahu Shi, Wenming Tan, Yihong Gong

First of all, we design a deformable attention in-built Transformer backbone, which learns adaptive feature representations with deformable sampling locations and dynamic attention weights.

Crowd Counting