TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Out-of-Distribution Detection	ImageNet-1k vs NINCO	ViT-B-384 Mahalanobis (pre-trained on IN-21k)	AUROC	95.0	# 1
Out-of-Distribution Detection	ImageNet-1k vs NINCO	ViT-B-384 Mahalanobis (pre-trained on IN-21k)	FPR@95	27.5	# 1
Out-of-Distribution Detection	ImageNet-1k vs NINCO	EffNetb7 Relative Cosine Sim	AUROC	87.9	# 3
Out-of-Distribution Detection	ImageNet-1k vs NINCO	EffNetb7 Relative Cosine Sim	FPR@95	53.7	# 2
Out-of-Distribution Detection	ImageNet-1k vs NINCO	EffNetv2-M Relative Mahalanobis	AUROC	88.9	# 2
Out-of-Distribution Detection	ImageNet-1k vs NINCO	EffNetv2-M Relative Mahalanobis	FPR@95	54.9	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/in-or-out-fixing-imagenet-out-of-distribution/out-of-distribution-detection-on-imagenet-1k-13)](https://paperswithcode.com/sota/out-of-distribution-detection-on-imagenet-1k-13?p=in-or-out-fixing-imagenet-out-of-distribution)`

In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation

1 Jun 2023 · Julian Bitterwolf, Maximilian Müller, Matthias Hein ·

Out-of-distribution (OOD) detection is the problem of identifying inputs which are unrelated to the in-distribution task. The OOD detection performance when the in-distribution (ID) is ImageNet-1K is commonly being tested on a small range of test OOD datasets. We find that most of the currently used test OOD datasets, including datasets from the open set recognition (OSR) literature, have severe issues: In some cases more than 50$\%$ of the dataset contains objects belonging to one of the ID classes. These erroneous samples heavily distort the evaluation of OOD detectors. As a solution, we introduce with NINCO a novel test OOD dataset, each sample checked to be ID free, which with its fine-grained range of OOD classes allows for a detailed analysis of an OOD detector's strengths and failure modes, particularly when paired with a number of synthetic "OOD unit-tests". We provide detailed evaluations across a large set of architectures and OOD detection methods on NINCO and the unit-tests, revealing new insights about model weaknesses and the effects of pretraining on OOD detection performance. We provide code and data at https://github.com/j-cb/NINCO.

PDF Abstract

Code

Add Remove Mark official

j-cb/ninco official

Tasks

Add Remove

Open Set Learning

Out-of-Distribution Detection

Out of Distribution (OOD) Detection

Datasets

Introduced in the Paper:

NINCO

ImageNet-1k vs NINCO

Used in the Paper:

ImageNet

Places ImageNet-1K

iNaturalist

Results from the Paper

Edit

Ranked #1 on Out-of-Distribution Detection on ImageNet-1k vs NINCO (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Out-of-Distribution Detection	ImageNet-1k vs NINCO	ViT-B-384 Mahalanobis (pre-trained on IN-21k)	AUROC	95.0	# 1	Compare
Out-of-Distribution Detection	ImageNet-1k vs NINCO	ViT-B-384 Mahalanobis (pre-trained on IN-21k)	FPR@95	27.5	# 1	Compare
Out-of-Distribution Detection	ImageNet-1k vs NINCO	EffNetb7 Relative Cosine Sim	AUROC	87.9	# 3	Compare
Out-of-Distribution Detection	ImageNet-1k vs NINCO	EffNetb7 Relative Cosine Sim	FPR@95	53.7	# 2	Compare
Out-of-Distribution Detection	ImageNet-1k vs NINCO	EffNetv2-M Relative Mahalanobis	AUROC	88.9	# 2	Compare
Out-of-Distribution Detection	ImageNet-1k vs NINCO	EffNetv2-M Relative Mahalanobis	FPR@95	54.9	# 3	Compare

Methods

Add Remove

Test

Edit Social Preview

In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove