TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Generation	ObjectsRoom	GENESIS-V2	FID	52.6	# 1
Unsupervised Object Segmentation	ObjectsRoom	GENESIS	ARI-FG	0.63	# 4
Image Generation	ObjectsRoom	GENESIS	FID	62.8	# 2
Unsupervised Object Segmentation	ObjectsRoom	SlotAttention	ARI-FG	0.79	# 3
Image Generation	ObjectsRoom	MONET-G	FID	205.7	# 3
Unsupervised Object Segmentation	ObjectsRoom	GENESIS-V2	ARI-FG	0.84	# 2
Unsupervised Object Segmentation	ObjectsRoom	MONET-G	ARI-FG	0.54	# 5
Unsupervised Object Segmentation	ShapeStacks	GENESIS	ARI-FG	0.70	# 4
Unsupervised Object Segmentation	ShapeStacks	GENESIS-V2	ARI-FG	0.81	# 2
Image Generation	ShapeStacks	GENESIS-V2	FID	112.7	# 1
Image Generation	ShapeStacks	GENESIS	FID	186.8	# 2
Image Generation	ShapeStacks	MONET-G	FID	197.8	# 3
Unsupervised Object Segmentation	ShapeStacks	MONET-G	ARI-FG	0.70	# 4
Unsupervised Object Segmentation	ShapeStacks	SlotAttention	ARI-FG	0.76	# 3
Unsupervised Object Segmentation	Shelf&Tote Training Dataset	SlotAttention	ARI	0.03	# 4
Unsupervised Object Segmentation	Shelf&Tote Training Dataset	GENESIS-V2	ARI	0.55	# 1
Unsupervised Object Segmentation	Shelf&Tote Training Dataset	MONET-G	ARI	0.11	# 2
Unsupervised Object Segmentation	Shelf&Tote Training Dataset	GENESIS	ARI	0.04	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/genesis-v2-inferring-unordered-object/image-generation-on-objectsroom)](https://paperswithcode.com/sota/image-generation-on-objectsroom?p=genesis-v2-inferring-unordered-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/genesis-v2-inferring-unordered-object/image-generation-on-shapestacks)](https://paperswithcode.com/sota/image-generation-on-shapestacks?p=genesis-v2-inferring-unordered-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/genesis-v2-inferring-unordered-object/unsupervised-object-segmentation-on-shelf)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on-shelf?p=genesis-v2-inferring-unordered-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/genesis-v2-inferring-unordered-object/unsupervised-object-segmentation-on-1)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on-1?p=genesis-v2-inferring-unordered-object)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/genesis-v2-inferring-unordered-object/unsupervised-object-segmentation-on)](https://paperswithcode.com/sota/unsupervised-object-segmentation-on?p=genesis-v2-inferring-unordered-object)`

GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement

NeurIPS 2021 · Martin Engelcke, Oiwi Parker Jones, Ingmar Posner ·

Advances in unsupervised learning of object-representations have culminated in the development of a broad range of methods for unsupervised object segmentation and interpretable object-centric scene generation. These methods, however, are limited to simulated and real-world datasets with limited visual complexity. Moreover, object representations are often inferred using RNNs which do not scale well to large images or iterative refinement which avoids imposing an unnatural ordering on objects in an image but requires the a priori initialisation of a fixed number of object representations. In contrast to established paradigms, this work proposes an embedding-based approach in which embeddings of pixels are clustered in a differentiable fashion using a stochastic stick-breaking process. Similar to iterative refinement, this clustering procedure also leads to randomly ordered object representations, but without the need of initialising a fixed number of clusters a priori. This is used to develop a new model, GENESIS-v2, which can infer a variable number of object representations without using RNNs or iterative refinement. We show that GENESIS-v2 performs strongly in comparison to recent baselines in terms of unsupervised image segmentation and object-centric scene generation on established synthetic datasets as well as more complex real-world datasets.

PDF Abstract NeurIPS 2021 PDF NeurIPS 2021 Abstract

Code

Add Remove Mark official

applied-ai-lab/genesis official

jinyangyuan/genesis

Tasks

Add Remove

Clustering

Image Generation

Image Segmentation

Object

Representation Learning

Scene Generation

Unsupervised Image Segmentation

Unsupervised Object Segmentation

Datasets

ShapeStacks ObjectsRoom Sketchy Shelf&Tote Training Dataset

Results from the Paper

Edit

Ranked #1 on Unsupervised Object Segmentation on Shelf&Tote Training Dataset

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Generation	ObjectsRoom	GENESIS-V2	FID	52.6	# 1	Compare
Unsupervised Object Segmentation	ObjectsRoom	GENESIS	ARI-FG	0.63	# 4	Compare
Image Generation	ObjectsRoom	GENESIS	FID	62.8	# 2	Compare
Unsupervised Object Segmentation	ObjectsRoom	SlotAttention	ARI-FG	0.79	# 3	Compare
Image Generation	ObjectsRoom	MONET-G	FID	205.7	# 3	Compare
Unsupervised Object Segmentation	ObjectsRoom	GENESIS-V2	ARI-FG	0.84	# 2	Compare
Unsupervised Object Segmentation	ObjectsRoom	MONET-G	ARI-FG	0.54	# 5	Compare
Unsupervised Object Segmentation	ShapeStacks	GENESIS	ARI-FG	0.70	# 4	Compare
Unsupervised Object Segmentation	ShapeStacks	GENESIS-V2	ARI-FG	0.81	# 2	Compare
Image Generation	ShapeStacks	GENESIS-V2	FID	112.7	# 1	Compare
Image Generation	ShapeStacks	GENESIS	FID	186.8	# 2	Compare
Image Generation	ShapeStacks	MONET-G	FID	197.8	# 3	Compare
Unsupervised Object Segmentation	ShapeStacks	MONET-G	ARI-FG	0.70	# 4	Compare
Unsupervised Object Segmentation	ShapeStacks	SlotAttention	ARI-FG	0.76	# 3	Compare
Unsupervised Object Segmentation	Shelf&Tote Training Dataset	SlotAttention	ARI	0.03	# 4	Compare
Unsupervised Object Segmentation	Shelf&Tote Training Dataset	GENESIS-V2	ARI	0.55	# 1	Compare
Unsupervised Object Segmentation	Shelf&Tote Training Dataset	MONET-G	ARI	0.11	# 2	Compare
Unsupervised Object Segmentation	Shelf&Tote Training Dataset	GENESIS	ARI	0.04	# 3	Compare

Methods

Add Remove

Adam • Concatenated Skip Connection • Convolution • GECO • Group Normalization • IC-SBP • LSTM • Max Pooling • ReLU • Sigmoid Activation • Tanh Activation • Transposed convolution • U-Net • VAE

Edit Social Preview

GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove