Hybrid Task Cascade for Instance Segmentation

Cascade is a classic yet powerful architecture that has boosted performance on various tasks. However, how to introduce cascade to instance segmentation remains an open question. A simple combination of Cascade R-CNN and Mask R-CNN only brings limited gain. In exploring a more effective approach, we find that the key to a successful instance segmentation cascade is to fully leverage the reciprocal relationship between detection and segmentation. In this work, we propose a new framework, Hybrid Task Cascade (HTC), which differs in two important aspects: (1) instead of performing cascaded refinement on these two tasks separately, it interweaves them for a joint multi-stage processing; (2) it adopts a fully convolutional branch to provide spatial context, which can help distinguishing hard foreground from cluttered background. Overall, this framework can learn more discriminative features progressively while integrating complementary features together in each stage. Without bells and whistles, a single HTC obtains 38.4 and 1.5 improvement over a strong Cascade Mask R-CNN baseline on MSCOCO dataset. Moreover, our overall system achieves 48.6 mask AP on the test-challenge split, ranking 1st in the COCO 2018 Challenge Object Detection Task. Code is available at: https://github.com/open-mmlab/mmdetection.

PDF Abstract CVPR 2019 PDF CVPR 2019 Abstract

Datasets


Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Object Detection COCO minival HTC (cascade) box AP 43.2 # 129
AP50 59.4 # 82
AP75 40.7 # 87
APS 20.3 # 74
APM 40.9 # 73
APL 52.3 # 67
Instance Segmentation COCO minival HTC (ResNet-50) mask AP 38.2 # 77
Object Detection COCO-O HTC (ResNet-50) Average mAP 19.1 # 32
Effective Robustness 0.08 # 35
Object Detection COCO test-dev HTC (ResNeXt-101-FPN) box mAP 47.1 # 115
AP50 63.9 # 95
AP75 44.7 # 125
APS 22.8 # 114
APM 43.9 # 118
APL 54.6 # 105
Hardware Burden None # 1
Operations per network pass None # 1
Instance Segmentation COCO test-dev HTC + ResNeXt-101-FPN mask AP 41.2% # 61
Instance Segmentation COCO test-dev HTC + ResNeXt-101-FPN + DCN mask AP 41.2 # 61

Methods