In object counting, objects often exhibit different sizes at different scales, even if they have similar physical sizes in reality. This is particularly true when targeting crowd counting and vehicle… Click to show full abstract
In object counting, objects often exhibit different sizes at different scales, even if they have similar physical sizes in reality. This is particularly true when targeting crowd counting and vehicle counting in intelligent transportation. Failing to model such variations leads to the mismatch between the object size and image scale. To address this problem, existing methods often extract multi-scale features, but they either still generate the single-scale prediction or lack an explicit suppression mechanism to eliminate predictions engendered by inappropriate scales. Our scale analysis manifests that, the single-scale estimation only works well for objects of certain sizes, and a suppression operator is required to isolate the estimation of a specific scale. In this work, we propose a scale-aware counting network termed NSSNet. NSSNet has two key features: it not only i) generates multi-scale predictions but also ii) applies a novel non-scale suppression (NSS) operator to suppress scale-mismatched estimations. NSS is inspired by the widely-used non-maximum suppression (NMS). In contrast to NMS that only reserves the maximum response, NSS filters out those clearly wrong predictions (the remaining predictions may still be from multiple scales). We evaluate NSSNet on four standard crowd and vehicle counting benchmarks and report state-of-the-art performance. We also show the scale adaptability of NSSNet through a controlled multi-scale experiment. Code and pretrained models are available at https://git.io/nssnet.
               
Click one of the above tabs to view related content.