TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Monocular 3D Object Detection	KITTI Cars Easy	CaDDN	AP Easy	19.17	# 4
Monocular 3D Object Detection	KITTI Cars Hard	CaDDN	AP Hard	11.46	# 4
Monocular 3D Object Detection	KITTI Cars Moderate	CaDDN	AP Medium	13.41	# 11
Monocular 3D Object Detection	KITTI Cyclist Easy	CaDDN	AP Easy	7.00	# 2
Monocular 3D Object Detection	KITTI Cyclist Hard	CaDDN	AP Hard	3.30	# 2
Monocular 3D Object Detection	KITTI Cyclist Moderate	CaDDN	AP Medium	3.41	# 2
Monocular 3D Object Detection	KITTI Pedestrian Easy	CaDDN	AP Easy	12.87	# 3
Monocular 3D Object Detection	KITTI Pedestrian Hard	CaDDN	AP Hard	6.76	# 3
Monocular 3D Object Detection	KITTI Pedestrian Moderate	CaDDN	AP Medium	8.14	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-5)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-5?p=categorical-depth-distribution-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-7)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-7?p=categorical-depth-distribution-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-6)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-6?p=categorical-depth-distribution-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-3)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-3?p=categorical-depth-distribution-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-1)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-1?p=categorical-depth-distribution-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-4)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-4?p=categorical-depth-distribution-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-cars-2)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-cars-2?p=categorical-depth-distribution-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-cars-1)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-cars-1?p=categorical-depth-distribution-network-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/categorical-depth-distribution-network-for/monocular-3d-object-detection-on-kitti-cars)](https://paperswithcode.com/sota/monocular-3d-object-detection-on-kitti-cars?p=categorical-depth-distribution-network-for)`

Categorical Depth Distribution Network for Monocular 3D Object Detection

CVPR 2021 · Cody Reading, Ali Harakeh, Julia Chae, Steven L. Waslander ·

Monocular 3D object detection is a key problem for autonomous vehicles, as it provides a solution with simple configuration compared to typical multi-sensor systems. The main challenge in monocular 3D detection lies in accurately predicting object depth, which must be inferred from object and scene cues due to the lack of direct range measurement. Many methods attempt to directly estimate depth to assist in 3D detection, but show limited performance as a result of depth inaccuracy. Our proposed solution, Categorical Depth Distribution Network (CaDDN), uses a predicted categorical depth distribution for each pixel to project rich contextual feature information to the appropriate depth interval in 3D space. We then use the computationally efficient bird's-eye-view projection and single-stage detector to produce the final output bounding boxes. We design CaDDN as a fully differentiable end-to-end approach for joint depth estimation and object detection. We validate our approach on the KITTI 3D object detection benchmark, where we rank 1st among published monocular methods. We also provide the first monocular 3D detection results on the newly released Waymo Open Dataset. We provide a code release for CaDDN which is made available.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

TRAILab/CaDDN official

354

PaddlePaddle/Paddle3D

533

Tasks

Add Remove

3D Object Detection

Autonomous Vehicles

Depth Estimation

Monocular 3D Object Detection

Object

object-detection

Object Detection

Datasets

KITTI

Waymo Open Dataset

Results from the Paper

Edit

Ranked #2 on Monocular 3D Object Detection on KITTI Cyclist Moderate

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Monocular 3D Object Detection	KITTI Cars Easy	CaDDN	AP Easy	19.17	# 4	Compare
Monocular 3D Object Detection	KITTI Cars Hard	CaDDN	AP Hard	11.46	# 4	Compare
Monocular 3D Object Detection	KITTI Cars Moderate	CaDDN	AP Medium	13.41	# 11	Compare
Monocular 3D Object Detection	KITTI Cyclist Easy	CaDDN	AP Easy	7.00	# 2	Compare
Monocular 3D Object Detection	KITTI Cyclist Hard	CaDDN	AP Hard	3.30	# 2	Compare
Monocular 3D Object Detection	KITTI Cyclist Moderate	CaDDN	AP Medium	3.41	# 2	Compare
Monocular 3D Object Detection	KITTI Pedestrian Easy	CaDDN	AP Easy	12.87	# 3	Compare
Monocular 3D Object Detection	KITTI Pedestrian Hard	CaDDN	AP Hard	6.76	# 3	Compare
Monocular 3D Object Detection	KITTI Pedestrian Moderate	CaDDN	AP Medium	8.14	# 3	Compare

Methods

Add Remove

1x1 Convolution • ASPP • Batch Normalization • DeepLabv3 • Dilated Convolution • Spatial Pyramid Pooling

Edit Social Preview

Categorical Depth Distribution Network for Monocular 3D Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove