Articles with "attention pyramid" as a keyword



Photo from wikipedia

AP-CNN: Weakly Supervised Attention Pyramid Convolutional Neural Network for Fine-Grained Visual Classification

Sign Up to like & get
recommendations!
Published in 2021 at "IEEE Transactions on Image Processing"

DOI: 10.1109/tip.2021.3055617

Abstract: Classifying the sub-categories of an object from the same super-category (e.g., bird species and cars) in fine-grained visual classification (FGVC) highly relies on discriminative feature representation and accurate region localization. Existing approaches mainly focus on… read more here.

Keywords: attention; grained visual; cnn; fine grained ... See more keywords
Photo by paipai90 from unsplash

Person Re-Identification via Attention Pyramid

Sign Up to like & get
recommendations!
Published in 2021 at "IEEE Transactions on Image Processing"

DOI: 10.1109/tip.2021.3107211

Abstract: In this paper, we propose an attention pyramid method for person re-identification. Unlike conventional attention-based methods which only learn a global attention map, our attention pyramid exploits the attention regions in a multi-scale manner because… read more here.

Keywords: attention; person identification; attention pyramid;
Photo from wikipedia

Dual Attention on Pyramid Feature Maps for Image Captioning

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Multimedia"

DOI: 10.1109/tmm.2021.3072479

Abstract: Generating natural sentences from images is a fundamental learning task for visual-semantic understanding in multimedia. In this paper, we propose to apply dual attention on pyramid image feature maps to fully explore the visual-semantic correlations… read more here.

Keywords: attention; feature; image; attention pyramid ... See more keywords
Photo from wikipedia

Transformer-Based Model with Dynamic Attention Pyramid Head for Semantic Segmentation of VHR Remote Sensing Imagery

Sign Up to like & get
recommendations!
Published in 2022 at "Entropy"

DOI: 10.3390/e24111619

Abstract: Convolutional neural networks have long dominated semantic segmentation of very-high-resolution (VHR) remote sensing (RS) images. However, restricted by the fixed receptive field of convolution operation, convolution-based models cannot directly obtain contextual information. Meanwhile, Swin Transformer… read more here.

Keywords: pyramid head; attention; attention pyramid; swin transformer ... See more keywords