Internet of Things (IoT) has brought changes in many fields by joining physical space with the cyber space. The IoT devices are becoming increasingly complex. With the rapid deployment of… Click to show full abstract
Internet of Things (IoT) has brought changes in many fields by joining physical space with the cyber space. The IoT devices are becoming increasingly complex. With the rapid deployment of cameras, tasks in IoT like visual information are more important, but IoT devices have limited computing resources, including power, computing ability, storage, etc. Some tasks that might be perfectly normal to perform on a computer would be rather challenging on an IoT device. Therefore, how to maintain acceptable performance while minimizing resources is becoming a more consequential part in IoT. In the paper, we aim to solve the problem of object detection and tracking in IoT while minimizing resources. The traditional algorithms need to use convolutional neural network (CNN) to identify different objects in each frame, and then determine the tracking target from identified objects, which typically requires a lot of computing resources. By incorporating spatial attention, and multi-domain network, we proposed a novel algorithm named as Spatial Attention Powered Multi-Domain Network (SA-MDNet). By adding spatial attention mechanism to the original MDNet model, and using multi class cross entropy loss, we are able to distinguish the background and the target in different video sequences effectively and efficiently. This novel algorithm achieves similar performance on the OTB 50/100/2013 datasets compared to several state-of-art models, while uses only a fraction of the memory compared to MDNet.
               
Click one of the above tabs to view related content.