Articles with "image text" as a keyword



Photo from wikipedia

Image-text dual neural network with decision strategy for small-sample image classification

Sign Up to like & get
recommendations!
Published in 2019 at "Neurocomputing"

DOI: 10.1016/j.neucom.2018.02.099

Abstract: Abstract Small-sample classification is a challenging problem in computer vision. In this work, we show how to efficiently and effectively utilize semantic information of the annotations to improve the performance of small-sample classification. First, we… read more here.

Keywords: dual neural; image; text dual; classification ... See more keywords
Photo by ewxy from unsplash

FB-Net: Dual-Branch Foreground-Background Fusion Network With Multi-Scale Semantic Scanning for Image-Text Retrieval

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Access"

DOI: 10.1109/access.2023.3263512

Abstract: As a fundamental branch in cross-modal retrieval, image-text retrieval is still a challenging problem largely due to the complementary and imbalanced relationship between different modalities. However, existing works have not effectively scanned and aligned the… read more here.

Keywords: text retrieval; foreground background; image; image text ... See more keywords
Photo from wikipedia

Hierarchical Knowledge-Based Graph Embedding Model for Image–Text Matching in IoTs

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Internet of Things Journal"

DOI: 10.1109/jiot.2021.3098897

Abstract: The development of Internet of Things systems (IoTs) and 5G technology has allowed image and text information to be collected and spread at an unprecedentedly high speed. To improve the data processing capabilities of IoTs,… read more here.

Keywords: text matching; knowledge; graph; image text ... See more keywords
Photo from wikipedia

Learning and Integrating Multi-Level Matching Features for Image-Text Retrieval

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2021.3135825

Abstract: In recent years, several retrieval methods for measuring the similarity between images and texts have been proposed. Despite the efficiency of most of these methods, the scalar-based cosine similarities may not be sufficiently expressive to… read more here.

Keywords: level matching; multi level; matching features; image text ... See more keywords
Photo by usgs from unsplash

Regularizing Visual Semantic Embedding With Contrastive Learning for Image-Text Matching

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Signal Processing Letters"

DOI: 10.1109/lsp.2022.3178899

Abstract: Learning visual semantic embedding for image-text matching has achieved high success by using triplet loss to pull positive image-text pairs which share similar semantic meaning and to push negative image-text pairs which share different semantic… read more here.

Keywords: image text; semantic embedding; visual semantic; text pairs ... See more keywords
Photo from wikipedia

Dual-Level Representation Enhancement on Characteristic and Context for Image-Text Retrieval

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3182426

Abstract: Image-text retrieval is a fundamental and vital task in multi-media retrieval and has received growing attention since it connects heterogeneous data. Previous methods that perform well on image-text retrieval mainly focus on the interaction between… read more here.

Keywords: text retrieval; level representation; image; image text ... See more keywords
Photo from wikipedia

Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text Search

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3186714

Abstract: Supervised cross-modal image-text hashing has aroused extensive concentrations in comprehending the correspondence between vision and language for data search tasks. Existing methods learn the compact hash codes by leveraging a given image-text data pairs or… read more here.

Keywords: joint semantic; semantic alignment; cross modal; image text ... See more keywords
Photo from wikipedia

Image-Text Retrieval With Cross-Modal Semantic Importance Consistency

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3220297

Abstract: Cross-modal image-text retrieval is an important area of Vision-and-Language task that models the similarity of image-text pairs by embedding features into a shared space for alignment. To bridge the heterogeneous gap between the two modalities,… read more here.

Keywords: cross modal; image text; importance; modal ... See more keywords
Photo from wikipedia

SMAN: Stacked Multimodal Attention Network for Cross-Modal Image-Text Retrieval.

Sign Up to like & get
recommendations!
Published in 2020 at "IEEE transactions on cybernetics"

DOI: 10.1109/tcyb.2020.2985716

Abstract: This article focuses on tackling the task of the cross-modal image-text retrieval which has been an interdisciplinary topic in both computer vision and natural language processing communities. Existing global representation alignment-based methods fail to pinpoint… read more here.

Keywords: attention; cross modal; image text;
Photo by kellysikkema from unsplash

Learning Relationship-Enhanced Semantic Graph for Fine-Grained Image-Text Matching.

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE transactions on cybernetics"

DOI: 10.1109/tcyb.2022.3179020

Abstract: Image-text matching of natural scenes has been a popular research topic in both computer vision and natural language processing communities. Recently, fine-grained image-text matching has shown its significant advance in inferring the high-level semantic correspondence… read more here.

Keywords: image text; text matching; graph; fine grained ... See more keywords
Photo by kellysikkema from unsplash

Adaptive Latent Graph Representation Learning for Image-Text Matching

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Image Processing"

DOI: 10.1109/tip.2022.3229631

Abstract: Image-text matching is a challenging task due to the modality gap. Many recent methods focus on modeling entity relationships to learn a common embedding space of image and text. However, these methods suffer from distractions… read more here.

Keywords: image text; text matching; graph; latent graph ... See more keywords