Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation

17 Sep 2018  ·  Shao-Yuan Lo, Hsueh-Ming Hang, Sheng-Wei Chan, Jing-Jhih Lin ·

Real-time semantic segmentation plays an important role in practical applications such as self-driving and robots. Most semantic segmentation research focuses on improving estimation accuracy with little consideration on efficiency. Several previous studies that emphasize high-speed inference often fail to produce high-accuracy segmentation results. In this paper, we propose a novel convolutional network named Efficient Dense modules with Asymmetric convolution (EDANet), which employs an asymmetric convolution structure and incorporates dilated convolution and dense connectivity to achieve high efficiency at low computational cost and model size. EDANet is 2.7 times faster than the existing fast segmentation network, ICNet, while it achieves a similar mIoU score without any additional context module, post-processing scheme, and pretrained model. We evaluate EDANet on Cityscapes and CamVid datasets, and compare it with the other state-of-art systems. Our network can run with the high-resolution inputs at the speed of 108 FPS on one GTX 1080Ti.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Semantic Segmentation CamVid EDANet Global Accuracy 90.8 # 2
Mean IoU 66.4 # 10
Real-Time Semantic Segmentation CamVid EDANet mIoU 66.4 # 22
Semantic Segmentation Cityscapes test EDANet Mean IoU (class) 67.3 # 87
Real-Time Semantic Segmentation Cityscapes test EDANet mIoU 67.3 # 34
Time (ms) 9.2 # 3
Frame (fps) 108.7 (1080Ti) # 5

Methods