Artificial visual attention has been an active research area for over two decades. Especially, the concept of saliency has been implemented in many different ways. Early approaches aimed at closely… Click to show full abstract
Artificial visual attention has been an active research area for over two decades. Especially, the concept of saliency has been implemented in many different ways. Early approaches aimed at closely modeling saliency processing with concepts from biological attention to provide (at least in the long run) general-purpose attention for technical systems. More recent approaches have departed from this agenda, turning to more specific attention-guided tasks, most notably the accurate extraction of salient objects, for which large-scale ground truth datasets make it possible to quantify progress. While the first type of models is troubled by weak performance in these specific tasks, the second type, as we show with a new benchmark, has lost the ability to predict saliency in the original sense, which may be an important factor for future general-purpose attention systems. Here, we describe a new approach using growing neural gas to obtain pre-attentional structures for a scene at an early processing stage. On this basis, traditional saliency concepts can be applied while at the same time they can be linked to mechanisms that make models successful in salient object detection. The model shows high performance at predicting traditional saliency and makes substantial progress toward salient object detection, although it cannot reach the top-level performance of some specialized methods. We discuss the important implications of our findings.
               
Click one of the above tabs to view related content.