TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	box mAP	48.2	# 93
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	AP50	67.4	# 50
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	AP75	52.6	# 47
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	APS	29.2	# 52
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	APM	51.7	# 42
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	APL	60.2	# 49
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	Hardware Burden	None	# 1
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	Operations per network pass	None	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/generalized-focal-loss-learning-qualified-and/object-detection-on-coco)](https://paperswithcode.com/sota/object-detection-on-coco?p=generalized-focal-loss-learning-qualified-and)`

Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection

NeurIPS 2020 · Xiang Li, Wenhai Wang, Lijun Wu, Shuo Chen, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang ·

One-stage detector basically formulates object detection as dense classification and localization. The classification is usually optimized by Focal Loss and the box location is commonly learned under Dirac delta distribution. A recent trend for one-stage detectors is to introduce an individual prediction branch to estimate the quality of localization, where the predicted quality facilitates the classification to improve detection performance. This paper delves into the representations of the above three fundamental elements: quality estimation, classification and localization. Two problems are discovered in existing practices, including (1) the inconsistent usage of the quality estimation and classification between training and inference and (2) the inflexible Dirac delta distribution for localization when there is ambiguity and uncertainty in complex scenes. To address the problems, we design new representations for these elements. Specifically, we merge the quality estimation into the class prediction vector to form a joint representation of localization quality and classification, and use a vector to represent arbitrary distribution of box locations. The improved representations eliminate the inconsistency risk and accurately depict the flexible distribution in real data, but contain continuous labels, which is beyond the scope of Focal Loss. We then propose Generalized Focal Loss (GFL) that generalizes Focal Loss from its discrete form to the continuous version for successful optimization. On COCO test-dev, GFL achieves 45.0\% AP using ResNet-101 backbone, surpassing state-of-the-art SAPD (43.5\%) and ATSS (43.6\%) with higher or comparable inference speed, under the same backbone and training settings. Notably, our best model can achieve a single-model single-scale AP of 48.2\%, at 10 FPS on a single 2080Ti GPU. Code and models are available at https://github.com/implus/GFocal.

PDF Abstract NeurIPS 2020 PDF NeurIPS 2020 Abstract

Code

Add Remove Mark official

implus/GFocal official

563

open-mmlab/mmdetection

27,708

PaddlePaddle/PaddleDetection

12,029

RangiLyu/nanodet

5,530

Yuxiang1995/ICDAR2021_MFD

118

See all 7 implementations

Tasks

Add Remove

Dense Object Detection

General Classification

Object Detection

Datasets

MS COCO

Results from the Paper

Edit

Ranked #93 on Object Detection on COCO test-dev

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Object Detection	COCO test-dev	GFL (X-101-32x4d-DCN, single-scale)	box mAP	48.2	# 93	Compare
			AP50	67.4	# 50	Compare
			AP75	52.6	# 47	Compare
			APS	29.2	# 52	Compare
			APM	51.7	# 42	Compare
			APL	60.2	# 49	Compare
			Hardware Burden	None	# 1	Compare
			Operations per network pass	None	# 1	Compare

Methods

Add Remove

1x1 Convolution • ATSS • Average Pooling • Batch Normalization • Convolution • Deformable Convolution • Focal Loss • Generalized Focal Loss • Global Average Pooling • Grouped Convolution • Kaiming Initialization • ReLU • Residual Connection • ResNeXt • ResNeXt Block

Edit Social Preview

Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove