TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
6D Pose Estimation using RGBD	REAL275	FS-Net	mAP 10, 10cm	64.6	# 3
6D Pose Estimation using RGBD	REAL275	FS-Net	mAP 10, 5cm	60.8	# 7
6D Pose Estimation using RGBD	REAL275	FS-Net	mAP 3DIou@25	95.1	# 2
6D Pose Estimation using RGBD	REAL275	FS-Net	mAP 3DIou@50	92.2	# 1
6D Pose Estimation using RGBD	REAL275	FS-Net	mAP 5, 5cm	28.2	# 9
6D Pose Estimation using RGBD	REAL275	FS-Net	mAP 3DIou@75	63.5	# 3
6D Pose Estimation using RGBD	REAL275	FS-Net	FPS	20	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fs-net-fast-shape-based-network-for-category/6d-pose-estimation-using-rgbd-on-real275)](https://paperswithcode.com/sota/6d-pose-estimation-using-rgbd-on-real275?p=fs-net-fast-shape-based-network-for-category)`

FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism

CVPR 2021 · Wei Chen, Xi Jia, Hyung Jin Chang, Jinming Duan, Linlin Shen, Ales Leonardis ·

In this paper, we focus on category-level 6D pose and size estimation from monocular RGB-D image. Previous methods suffer from inefficient category-level pose feature extraction which leads to low accuracy and inference speed. To tackle this problem, we propose a fast shape-based network (FS-Net) with efficient category-level feature extraction for 6D pose estimation. First, we design an orientation aware autoencoder with 3D graph convolution for latent feature extraction. The learned latent feature is insensitive to point shift and object size thanks to the shift and scale-invariance properties of the 3D graph convolution. Then, to efficiently decode category-level rotation information from the latent feature, we propose a novel decoupled rotation mechanism that employs two decoders to complementarily access the rotation information. Meanwhile, we estimate translation and size by two residuals, which are the difference between the mean of object points and ground truth translation, and the difference between the mean size of the category and ground truth size, respectively. Finally, to increase the generalization ability of FS-Net, we propose an online box-cage based 3D deformation mechanism to augment the training data. Extensive experiments on two benchmark datasets show that the proposed method achieves state-of-the-art performance in both category- and instance-level 6D object pose estimation. Especially in category-level pose estimation, without extra synthetic data, our method outperforms existing methods by 6.3% on the NOCS-REAL dataset.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract

Code

Add Remove Mark official

DC1991/FS-Net official

DC1991/FS_Net

Tasks

Add Remove

6D Pose Estimation

6D Pose Estimation using RGB

6D Pose Estimation using RGBD

Pose Estimation

Translation

Datasets

REAL275

Results from the Paper

Edit

Ranked #7 on 6D Pose Estimation using RGBD on REAL275

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
6D Pose Estimation using RGBD	REAL275	FS-Net	mAP 10, 10cm	64.6	# 3	Compare
			mAP 10, 5cm	60.8	# 7	Compare
			mAP 3DIou@25	95.1	# 2	Compare
			mAP 3DIou@50	92.2	# 1	Compare
			mAP 5, 5cm	28.2	# 9	Compare
			mAP 3DIou@75	63.5	# 3	Compare
			FPS	20	# 1	Compare

Methods

Add Remove

AutoEncoder • AWARE • Convolution

Edit Social Preview

FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove