TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Unsupervised Monocular Depth Estimation	Cityscapes	RM-Depth	RMSE	5.503	# 1
Unsupervised Monocular Depth Estimation	Cityscapes	RM-Depth	RMSE log	0.143	# 1
Unsupervised Monocular Depth Estimation	Cityscapes	RM-Depth	Square relative error (SqRel)	0.825	# 1
Unsupervised Monocular Depth Estimation	Cityscapes	RM-Depth	Absolute relative error (AbsRel)	0.09	# 1
Unsupervised Monocular Depth Estimation	Cityscapes	RM-Depth	Test frames	1	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/rm-depth-unsupervised-learning-of-recurrent-1/unsupervised-monocular-depth-estimation-on)](https://paperswithcode.com/sota/unsupervised-monocular-depth-estimation-on?p=rm-depth-unsupervised-learning-of-recurrent-1)`

RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes

CVPR 2022 · Tak-Wai Hui ·

Unsupervised methods have showed promising results on monocular depth estimation. However, the training data must be captured in scenes without moving objects. To push the envelope of accuracy, recent methods tend to increase their model parameters. In this paper, an unsupervised learning framework is proposed to jointly predict monocular depth and complete 3D motion including the motions of moving objects and camera. (1) Recurrent modulation units are used to adaptively and iteratively fuse encoder and decoder features. This not only improves the single-image depth inference but also does not overspend model parameters. (2) Instead of using a single set of filters for upsampling, multiple sets of filters are devised for the residual upsampling. This facilitates the learning of edge-preserving filters and leads to the improved performance. (3) A warping-based network is used to estimate a motion field of moving objects without using semantic priors. This breaks down the requirement of scene rigidity and allows to use general videos for the unsupervised learning. The motion field is further regularized by an outlier-aware training loss. Despite the depth model just uses a single image in test time and 2.97M parameters, it achieves state-of-the-art results on the KITTI and Cityscapes benchmarks.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

twhui/rm-depth official

Tasks

Add Remove

Depth Estimation

Monocular Depth Estimation

Unsupervised Monocular Depth Estimation

Datasets

Cityscapes

Results from the Paper

Edit

Ranked #1 on Unsupervised Monocular Depth Estimation on Cityscapes

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Unsupervised Monocular Depth Estimation	Cityscapes	RM-Depth	RMSE	5.503	# 1	Compare
			RMSE log	0.143	# 1	Compare
			Square relative error (SqRel)	0.825	# 1	Compare
			Absolute relative error (AbsRel)	0.09	# 1	Compare
			Test frames	1	# 1	Compare

Methods

Add Remove

Test

Edit Social Preview

RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove