TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Self-Supervised Action Recognition	HMDB51	pBYOL	Top-1 Accuracy	75.0	# 3
Self-Supervised Action Recognition	HMDB51	pBYOL	Pre-Training Dataset	Kinetics400	# 1
Self-Supervised Action Recognition	HMDB51	pBYOL	Frozen	false	# 1
Self-Supervised Action Recognition	UCF101	pBYOL	3-fold Accuracy	96.3	# 5
Self-Supervised Action Recognition	UCF101	pBYOL	Pre-Training Dataset	Kinetics400	# 1
Self-Supervised Action Recognition	UCF101	pBYOL	Frozen	false	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-large-scale-study-on-unsupervised/self-supervised-action-recognition-on-hmdb51)](https://paperswithcode.com/sota/self-supervised-action-recognition-on-hmdb51?p=a-large-scale-study-on-unsupervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/a-large-scale-study-on-unsupervised/self-supervised-action-recognition-on-ucf101)](https://paperswithcode.com/sota/self-supervised-action-recognition-on-ucf101?p=a-large-scale-study-on-unsupervised)`

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

CVPR 2021 · Christoph Feichtenhofer, Haoqi Fan, Bo Xiong, Ross Girshick, Kaiming He ·

We present a large-scale study on unsupervised spatiotemporal representation learning from videos. With a unified perspective on four recent image-based frameworks, we study a simple objective that can easily generalize all these methods to space-time. Our objective encourages temporally-persistent features in the same video, and in spite of its simplicity, it works surprisingly well across: (i) different unsupervised frameworks, (ii) pre-training datasets, (iii) downstream datasets, and (iv) backbone architectures. We draw a series of intriguing observations from this study, e.g., we discover that encouraging long-spanned persistency can be effective even if the timespan is 60 seconds. In addition to state-of-the-art results in multiple benchmarks, we report a few promising cases in which unsupervised pre-training can outperform its supervised counterpart. Code is made available at https://github.com/facebookresearch/SlowFast

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

facebookresearch/SlowFast official

6,286

seleucia/goca

Tasks

Add Remove

Representation Learning

Self-Supervised Action Recognition

Unsupervised Pre-training

Datasets

UCF101

Kinetics

HMDB51

Kinetics 400

Charades

Something-Something V2

Kinetics-700

Results from the Paper

Edit

Ranked #3 on Self-Supervised Action Recognition on HMDB51

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Self-Supervised Action Recognition	HMDB51	pBYOL	Top-1 Accuracy	75.0	# 3	Compare
			Pre-Training Dataset	Kinetics400	# 1	Compare
			Frozen	false	# 1	Compare
Self-Supervised Action Recognition	UCF101	pBYOL	3-fold Accuracy	96.3	# 5	Compare
			Pre-Training Dataset	Kinetics400	# 1	Compare
			Frozen	false	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove