Search Results for author: Xinli Xu

Found 6 papers, 4 papers with code

DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving

no code implementations7 May 2024 Chen Min, Dawei Zhao, Liang Xiao, Jian Zhao, Xinli Xu, Zheng Zhu, Lei Jin, Jianshu Li, Yulan Guo, Junliang Xing, Liping Jing, Yiming Nie, Bin Dai

In this paper, we address this challenge by introducing a world model-based autonomous driving 4D representation learning framework, dubbed \emph{DriveWorld}, which is capable of pre-training from multi-camera driving videos in a spatio-temporal fashion.

3D Object Detection Motion Forecasting +4

Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting

2 code implementations10 Apr 2024 Hao Lu, Jiaqi Tang, Xinli Xu, Xu Cao, Yunpeng Zhang, Guoqing Wang, Dalong Du, Hao Chen, Yingcong Chen

Finally, for MC3D-Det joint training, the elaborate dataset merge strategy is designed to solve the problem of inconsistent camera numbers and camera parameters.

3D Object Detection Autonomous Driving +1

TiG-BEV: Multi-view BEV 3D Object Detection via Target Inner-Geometry Learning

1 code implementation28 Dec 2022 Peixiang Huang, Li Liu, Renrui Zhang, Song Zhang, Xinli Xu, Baichao Wang, Guoyi Liu

In this paper, we propose the learning scheme of Target Inner-Geometry from the LiDAR modality into camera-based BEV detectors for both dense depth and BEV features, termed as TiG-BEV.

3D Object Detection object-detection

FusionRCNN: LiDAR-Camera Fusion for Two-stage 3D Object Detection

no code implementations22 Sep 2022 Xinli Xu, Shaocong Dong, Lihe Ding, Jie Wang, Tingfa Xu, Jianan Li

Existing 3D detectors significantly improve the accuracy by adopting a two-stage paradigm which merely relies on LiDAR point clouds for 3D proposal refinement.

3D Object Detection Autonomous Driving +2

Occupancy-MAE: Self-supervised Pre-training Large-scale LiDAR Point Clouds with Masked Occupancy Autoencoders

2 code implementations20 Jun 2022 Chen Min, Xinli Xu, Dawei Zhao, Liang Xiao, Yiming Nie, Bin Dai

This work proposes a solution to reduce the dependence on labelled 3D training data by leveraging pre-training on large-scale unlabeled outdoor LiDAR point clouds using masked autoencoders (MAE).

3D Object Detection 3D Semantic Segmentation +6

Cannot find the paper you are looking for? You can Submit a new open access paper.