TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Video Prediction	Moving MNIST	SimVP+gSTA-Sx10	MSE	15.05	# 1
Video Prediction	Moving MNIST	SimVP+gSTA-Sx10	MAE	49.8	# 2
Video Prediction	Moving MNIST	SimVP+gSTA-Sx10	SSIM	0.967	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simvp-towards-simple-yet-powerful/video-prediction-on-moving-mnist)](https://paperswithcode.com/sota/video-prediction-on-moving-mnist?p=simvp-towards-simple-yet-powerful)`

SimVP: Towards Simple yet Powerful Spatiotemporal Predictive Learning

22 Nov 2022 · Cheng Tan, Zhangyang Gao, Siyuan Li, Stan Z. Li ·

Recent years have witnessed remarkable advances in spatiotemporal predictive learning, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated training strategies. Although impressive, the system complexity of mainstream methods is increasing as well, which may hinder the convenient applications. This paper proposes SimVP, a simple spatiotemporal predictive baseline model that is completely built upon convolutional networks without recurrent architectures and trained by common mean squared error loss in an end-to-end fashion. Without introducing any extra tricks and strategies, SimVP can achieve superior performance on various benchmark datasets. To further improve the performance, we derive variants with the gated spatiotemporal attention translator from SimVP that can achieve better performance. We demonstrate that SimVP has strong generalization and extensibility on real-world datasets through extensive experiments. The significant reduction in training cost makes it easier to scale to complex scenarios. We believe SimVP can serve as a solid baseline to benefit the spatiotemporal predictive learning community.

PDF Abstract