TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Monocular Depth Estimation	DDAD	TransDSSL	absolute relative error	0.151	# 3
Monocular Depth Estimation	DDAD	TransDSSL	Sq Rel	3.591	# 3
Monocular Depth Estimation	DDAD	TransDSSL	RMSE	14.350	# 3
Monocular Depth Estimation	DDAD	TransDSSL	RMSE log	0.172	# 2
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	absolute relative error	0.095	# 7
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	RMSE	4.321	# 10
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	Sq Rel	0.711	# 14
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	RMSE log	0.172	# 6
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	Delta < 1.25	0.906	# 6
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	Delta < 1.25^2	0.967	# 7
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	Delta < 1.25^3	0.984	# 5
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	Mono	O	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/transdssl-transformer-based-depth-estimation/monocular-depth-estimation-on-ddad)](https://paperswithcode.com/sota/monocular-depth-estimation-on-ddad?p=transdssl-transformer-based-depth-estimation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/transdssl-transformer-based-depth-estimation/monocular-depth-estimation-on-kitti-eigen-1)](https://paperswithcode.com/sota/monocular-depth-estimation-on-kitti-eigen-1?p=transdssl-transformer-based-depth-estimation)`

TransDSSL: Transformer based Depth Estimation via Self-Supervised Learning

journal 2022 · Daechan Han, Jeongmin Shin, Namil Kim, Soomnim Hwang, Yukyung Choi ·

Recently, transformers have been widely adopted for various computer vision tasks and show promising results due to their ability to encode long-range spatial dependencies in an image effectively. However, very few studies on adopting transformers in self-supervised depth estimation have been conducted. When replacing the CNN architecture with the transformer in self-supervised learning of depth, we encounter several problems such as problematic multi-scale photometric loss function when used with transformers and, insuffcient ability to capture local details. In this paper, we propose an attention-based decoder module, Pixel-Wise Skip Attention (PWSA), to enhance fine details in feature maps while keeping global context from transformers. In addition, we propose utilizing self-distillation loss with single-scale photometric loss to alleviate the instability of transformer training by using correct training signals. We demonstrate that the proposed model performs accurate predictions on large objects and thin structures that require global context and local details. Our model achieves state-ofthe-art performance among the self-supervised monocular depth estimation methods on KITTI and DDAD benchmarks

PDF Abstract

Code

Add Remove Mark official

sejong-rcv/2021.Paper.TransDSSL official

Tasks

Add Remove

Decoder

Depth Estimation

Monocular Depth Estimation

Self-Supervised Learning

Unsupervised Monocular Depth Estimation

Datasets

KITTI

DDAD

Results from the Paper

Add Remove

Ranked #3 on Monocular Depth Estimation on DDAD

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Monocular Depth Estimation	DDAD	TransDSSL	absolute relative error	0.151	# 3	Compare
			Sq Rel	3.591	# 3	Compare
			RMSE	14.350	# 3	Compare
			RMSE log	0.172	# 2	Compare
Monocular Depth Estimation	KITTI Eigen split unsupervised	TransDSSL	absolute relative error	0.095	# 7	Compare
			RMSE	4.321	# 10	Compare
			Sq Rel	0.711	# 14	Compare
			RMSE log	0.172	# 6	Compare
			Delta < 1.25	0.906	# 6	Compare
			Delta < 1.25^2	0.967	# 7	Compare
			Delta < 1.25^3	0.984	# 5	Compare
			Mono	O	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

TransDSSL: Transformer based Depth Estimation via Self-Supervised Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove