Over the last few years, researchers have focused on computer‐aided polyp detection in gastroscopy. Deep learning (DL) has shown great promise for polyps' identification. The most exceptional contribution of DL… Click to show full abstract
Over the last few years, researchers have focused on computer‐aided polyp detection in gastroscopy. Deep learning (DL) has shown great promise for polyps' identification. The most exceptional contribution of DL methods in gastroenterology is their ability to identify polyps quickly and accurately using convolution neural network. Nonetheless, despite significant advancements, automatic detection of small polyps remains a challenging and complex task. Furthermore, due to multiple pooling operations, the features of small polyps are lost, resulting in low detection accuracy. This paper proposes an efficient object detection method for polyp detection using gastric images to address this issue. A single‐shot multi‐box detector (SSD) was combined with the feature extractor VGG‐16, and the Refined Map Block (RMB) was integrated into SSD's high‐resolution feature maps to get more semantic information. The RMB output was used as the input to the successive layers. The RMB comprises of attention cascade and feature map concatenation cascade. The attention cascade improved the localization accuracy, while the feature map concatenation cascade improved the classification accuracy. Using the former, the proposed attention‐based SSD for gastric polyps (ASSD‐GPNet) model focused on the specific information, a polyp, rather than the background. Furthermore, the feature map concatenation cascade adds semantic information while reducing computational complexity. The output of these two cascades was combined to produce a refined feature map that enhances the detection of small polyps. The model was trained and tested on 1970 gastric images and Pascal VOC07 + 12. Image augmentation was applied to increase the training data of gastric images to reduce overfitting and skip connections were used to overcome the vanishing gradient problem. Overall, the experimental results demonstrated that the proposed model outperformed than compared models in both medical and natural images. The ASSD‐GPNet obtained mean average precision (mAP) of 94.2% on gastric images and 76.9% on Pascal VOC.
               
Click one of the above tabs to view related content.