Search Results for author: Yunpeng Zhang

Found 13 papers, 10 papers with code

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

2 code implementations • 10 Apr 2024 • Hao Lu, Jiaqi Tang, Xinli Xu, Xu Cao, Yunpeng Zhang, Guoqing Wang, Dalong Du, Hao Chen, Yingcong Chen

Finally, for MC3D-Det joint training, the elaborate dataset merge strategy is designed to solve the problem of inconsistent camera numbers and camera parameters.

3D Object Detection Autonomous Driving +1

211

Paper
Code

GraphAD: Interaction Scene Graph for End-to-end Autonomous Driving

1 code implementation • 28 Mar 2024 • Yunpeng Zhang, Deheng Qian, Ding Li, Yifeng Pan, Yong Chen, Zhenbao Liang, Zhiyao Zhang, Shurui Zhang, Hongxu Li, Maolei Fu, Yun Ye, Zhujin Liang, Yi Shan, Dalong Du

With the representation of the ISG, the driving agents aggregate essential information from the most influential elements, including the road agents with potential collisions and the map elements to follow.

Autonomous Driving

Paper
Code

Towards Generalizable Multi-Camera 3D Object Detection via Perspective Debiasing

1 code implementation • 17 Oct 2023 • Hao Lu, Yunpeng Zhang, Qing Lian, Dalong Du, Yingcong Chen

In our approach, we render diverse view maps from BEV features and rectify the perspective bias of these maps, leveraging implicit foreground volumes to bridge the camera and BEV planes.

3D Object Detection Domain Generalization +2

141

Paper
Code

OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction

1 code implementation • ICCV 2023 • Yunpeng Zhang, Zheng Zhu, Dalong Du

The vision-based perception for autonomous driving has undergone a transformation from the bird-eye-view (BEV) representations to the 3D semantic occupancy.

Ranked #3 on 3D Semantic Scene Completion from a single RGB image on SemanticKITTI

3D Semantic Occupancy Prediction 3D Semantic Scene Completion from a single RGB image +4

290

Paper
Code

A Simple Baseline for Supervised Surround-view Depth Estimation

no code implementations • 14 Mar 2023 • Xianda Guo, Wenjie Yuan, Yunpeng Zhang, Tian Yang, Chenming Zhang, Zheng Zhu, Long Chen

The former is achieved by the self-attention module within each view, while the latter is realized by the adjacent attention module, which computes the attention across multi-cameras to exchange the multi-scale representations across surround-view feature maps.

Autonomous Driving Monocular Depth Estimation

Paper
Add Code

OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception

1 code implementation • ICCV 2023 • XiaoFeng Wang, Zheng Zhu, Wenbo Xu, Yunpeng Zhang, Yi Wei, Xu Chi, Yun Ye, Dalong Du, Jiwen Lu, Xingang Wang

Towards a comprehensive benchmarking of surrounding perception algorithms, we propose OpenOccupancy, which is the first surrounding semantic occupancy perception benchmark.

Autonomous Driving Benchmarking

526

Paper
Code

Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction

2 code implementations • CVPR 2023 • Yuanhui Huang, Wenzhao Zheng, Yunpeng Zhang, Jie zhou, Jiwen Lu

To lift image features to the 3D TPV space, we further propose a transformer-based TPV encoder (TPVFormer) to obtain the TPV features effectively.

Ranked #1 on Prediction Of Occupancy Grid Maps on nuScenes

3D Semantic Scene Completion Autonomous Driving +1

4,858

Paper
Code

Are We Ready for Vision-Centric Driving Streaming Perception? The ASAP Benchmark

1 code implementation • CVPR 2023 • XiaoFeng Wang, Zheng Zhu, Yunpeng Zhang, Guan Huang, Yun Ye, Wenbo Xu, Ziwei Chen, Xingang Wang

To mitigate the problem, we propose the Autonomous-driving StreAming Perception (ASAP) benchmark, which is the first benchmark to evaluate the online performance of vision-centric perception in autonomous driving.

Depth Estimation Motion Forecasting

Paper
Code

A Simple Baseline for Multi-Camera 3D Object Detection

1 code implementation • 22 Aug 2022 • Yunpeng Zhang, Wenzhao Zheng, Zheng Zhu, Guan Huang, Jie zhou, Jiwen Lu

First, we extract multi-scale features and generate the perspective object proposals on each monocular image.

Autonomous Driving Monocular 3D Object Detection +2

Paper
Code

Motion Gait: Gait Recognition via Motion Excitation

no code implementations • 22 Jun 2022 • Yunpeng Zhang, Zhengyou Wang, Shanna Zhuang, Hui Wang

Gait recognition, which can realize long-distance and contactless identification, is an important biometric technology.

Gait Recognition

Paper
Add Code

BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving

1 code implementation • 19 May 2022 • Yunpeng Zhang, Zheng Zhu, Wenzhao Zheng, JunJie Huang, Guan Huang, Jie zhou, Jiwen Lu

Specifically, BEVerse first performs shared feature extraction and lifting to generate 4D BEV representations from multi-timestamp and multi-view images.

Ranked #15 on Robust Camera Only 3D Object Detection on nuScenes-C