TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Domain Adaptation	GTA5 to Cityscapes	TransDA-B	mIoU	63.9	# 11
Synthetic-to-Real Translation	GTAV-to-Cityscapes Labels	TransDA-B	mIoU	63.9	# 12
Unsupervised Domain Adaptation	GTAV-to-Cityscapes Labels	TransDA-B	mIoU	63.9	# 11
Semantic Segmentation	GTAV-to-Cityscapes Labels	TransDA-B	mIoU	63.9	# 7
Image-to-Image Translation	GTAV-to-Cityscapes Labels	TransDA-B	mIoU	63.9	# 9
Image-to-Image Translation	SYNTHIA-to-Cityscapes	TransDA-B	mIoU (13 classes)	66.3	# 8
Unsupervised Domain Adaptation	SYNTHIA-to-Cityscapes	TransDA-B	mIoU (13 classes)	66.3	# 10
Synthetic-to-Real Translation	SYNTHIA-to-Cityscapes	TransDA-B	MIoU (13 classes)	66.3	# 10
Synthetic-to-Real Translation	SYNTHIA-to-Cityscapes	TransDA-B	MIoU (16 classes)	59.3	# 10
Semantic Segmentation	SYNTHIA-to-Cityscapes	TransDA-B	Mean IoU	59.3	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/semantic-segmentation-on-gtav-to-cityscapes-1)](https://paperswithcode.com/sota/semantic-segmentation-on-gtav-to-cityscapes-1?p=smoothing-matters-momentum-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/semantic-segmentation-on-synthia-to)](https://paperswithcode.com/sota/semantic-segmentation-on-synthia-to?p=smoothing-matters-momentum-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/image-to-image-translation-on-synthia-to)](https://paperswithcode.com/sota/image-to-image-translation-on-synthia-to?p=smoothing-matters-momentum-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/image-to-image-translation-on-gtav-to)](https://paperswithcode.com/sota/image-to-image-translation-on-gtav-to?p=smoothing-matters-momentum-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/unsupervised-domain-adaptation-on-synthia-to)](https://paperswithcode.com/sota/unsupervised-domain-adaptation-on-synthia-to?p=smoothing-matters-momentum-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/synthetic-to-real-translation-on-synthia-to-1)](https://paperswithcode.com/sota/synthetic-to-real-translation-on-synthia-to-1?p=smoothing-matters-momentum-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/domain-adaptation-on-gta5-to-cityscapes)](https://paperswithcode.com/sota/domain-adaptation-on-gta5-to-cityscapes?p=smoothing-matters-momentum-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/unsupervised-domain-adaptation-on-gtav-to)](https://paperswithcode.com/sota/unsupervised-domain-adaptation-on-gtav-to?p=smoothing-matters-momentum-transformer-for)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/smoothing-matters-momentum-transformer-for/synthetic-to-real-translation-on-gtav-to)](https://paperswithcode.com/sota/synthetic-to-real-translation-on-gtav-to?p=smoothing-matters-momentum-transformer-for)`

Smoothing Matters: Momentum Transformer for Domain Adaptive Semantic Segmentation

15 Mar 2022 · Runfa Chen, Yu Rong, Shangmin Guo, Jiaqi Han, Fuchun Sun, Tingyang Xu, Wenbing Huang ·

After the great success of Vision Transformer variants (ViTs) in computer vision, it has also demonstrated great potential in domain adaptive semantic segmentation. Unfortunately, straightforwardly applying local ViTs in domain adaptive semantic segmentation does not bring in expected improvement. We find that the pitfall of local ViTs is due to the severe high-frequency components generated during both the pseudo-label construction and features alignment for target domains. These high-frequency components make the training of local ViTs very unsmooth and hurt their transferability. In this paper, we introduce a low-pass filtering mechanism, momentum network, to smooth the learning dynamics of target domain features and pseudo labels. Furthermore, we propose a dynamic of discrepancy measurement to align the distributions in the source and target domains via dynamic weights to evaluate the importance of the samples. After tackling the above issues, extensive experiments on sim2real benchmarks show that the proposed method outperforms the state-of-the-art methods. Our codes are available at https://github.com/alpc91/TransDA

PDF Abstract

Code

Add Remove Mark official

alpc91/transda official

Tasks

Add Remove

Domain Adaptation

Image-to-Image Translation

Pseudo Label

Segmentation

Semantic Segmentation

Synthetic-to-Real Translation

Unsupervised Domain Adaptation

Datasets

Cityscapes

SYNTHIA

GTA5

Results from the Paper

Edit

Ranked #7 on Semantic Segmentation on SYNTHIA-to-Cityscapes

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Domain Adaptation	GTA5 to Cityscapes	TransDA-B	mIoU	63.9	# 11	Compare
Synthetic-to-Real Translation	GTAV-to-Cityscapes Labels	TransDA-B	mIoU	63.9	# 12	Compare
Unsupervised Domain Adaptation	GTAV-to-Cityscapes Labels	TransDA-B	mIoU	63.9	# 11	Compare
Semantic Segmentation	GTAV-to-Cityscapes Labels	TransDA-B	mIoU	63.9	# 7	Compare
Image-to-Image Translation	GTAV-to-Cityscapes Labels	TransDA-B	mIoU	63.9	# 9	Compare
Image-to-Image Translation	SYNTHIA-to-Cityscapes	TransDA-B	mIoU (13 classes)	66.3	# 8	Compare
Unsupervised Domain Adaptation	SYNTHIA-to-Cityscapes	TransDA-B	mIoU (13 classes)	66.3	# 10	Compare
Synthetic-to-Real Translation	SYNTHIA-to-Cityscapes	TransDA-B	MIoU (13 classes)	66.3	# 10	Compare
Synthetic-to-Real Translation	SYNTHIA-to-Cityscapes	TransDA-B	MIoU (16 classes)	59.3	# 10	Compare
Semantic Segmentation	SYNTHIA-to-Cityscapes	TransDA-B	Mean IoU	59.3	# 7	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer • Vision Transformer

Edit Social Preview

Smoothing Matters: Momentum Transformer for Domain Adaptive Semantic Segmentation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove