Where are the Blobs: Counting by Localization with Point Supervision

Object counting is an important task in computer vision due to its growing demand in applications such as surveillance, traffic monitoring, and counting everyday objects. State-of-the-art methods use regression-based optimization where they explicitly learn to count the objects of interest. These often perform better than detection-based methods that need to learn the more difficult task of predicting the location, size, and shape of each object. However, we propose a detection-based method that does not need to estimate the size and shape of the objects and that outperforms regression-based methods. Our contributions are three-fold: (1) we propose a novel loss function that encourages the network to output a single blob per object instance using point-level annotations only; (2) we design two methods for splitting large predicted blobs between object instances; and (3) we show that our method achieves new state-of-the-art results on several challenging datasets including the Pascal VOC and the Penguins dataset. Our method even outperforms those that use stronger supervision such as depth features, multi-point annotations, and bounding-box labels.

PDF Abstract ECCV 2018 PDF ECCV 2018 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Object Counting COCO count-test LC-ResFCN m-reIRMSE 0.19 # 4
m-reIRMSE-nz 0.99 # 6
mRMSE 0.38 # 4
mRMSE-nz 2.20 # 5
Object Counting Pascal VOC 2007 count-test LC-ResFCN m-reIRMSE-nz 0.61 # 1
m-relRMSE 0.17 # 1
mRMSE 0.31 # 3
mRMSE-nz 1.20 # 3
Object Counting Pascal VOC 2007 count-test LC-PSPNet m-reIRMSE-nz 0.70 # 5
m-relRMSE 0.20 # 3
mRMSE 0.35 # 4
mRMSE-nz 1.32 # 4

Methods


No methods listed for this paper. Add relevant methods here