Articles with "caption" as a keyword



Cascaded transformer-based networks for wikipedia large-scale image-caption matching

Sign Up to like & get
recommendations!
Published in 2024 at "Multimedia Tools and Applications"

DOI: 10.1007/s11042-023-17977-0

Abstract: With the increasing importance of multimedia and multilingual data in online encyclopedias, novel methods are needed to fill domain gaps and automatically connect different modalities for increased accessibility. For example, Wikipedia is composed of millions… read more here.

Keywords: cascaded transformer; caption; image; caption matching ... See more keywords

A survey on automatic image caption generation

Sign Up to like & get
recommendations!
Published in 2018 at "Neurocomputing"

DOI: 10.1016/j.neucom.2018.05.080

Abstract: Abstract Image captioning means automatically generating a caption for an image. As a recently emerged research area, it is attracting more and more attention. To achieve the goal of image captioning, semantic information of images… read more here.

Keywords: image; research; image captioning; survey ... See more keywords

Phrase-based image caption generator with hierarchical LSTM network

Sign Up to like & get
recommendations!
Published in 2019 at "Neurocomputing"

DOI: 10.1016/j.neucom.2018.12.026

Abstract: Abstract Automatic generation of caption to describe the content of an image has been gaining a lot of research interests recently, where most of the existing works treat the image caption as pure sequential data.… read more here.

Keywords: based image; image caption; image; phrase based ... See more keywords
Photo from wikipedia

Where to put the image in an image caption generator

Sign Up to like & get
recommendations!
Published in 2018 at "Natural Language Engineering"

DOI: 10.1017/s1351324918000098

Abstract: Abstract When a recurrent neural network (RNN) language model is used for caption generation, the image information can be fed to the neural network either by directly incorporating it in the RNN – conditioning the… read more here.

Keywords: image; rnn; caption; language model ... See more keywords

InSeC: Steganalysis Model Based on Inter-Codeword Sensitivity Caption for Compressed Speech Streams

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE Access"

DOI: 10.1109/access.2024.3519094

Abstract: Recently, steganalysis of Voice over Internet Protocol (VoIP) compressed speech has gained attention. In real voice communication, Joint Parallel Steganography (JPS) often occurs, where multiple steganography algorithms coexist. The multifaceted nature of JPS, incorporating various… read more here.

Keywords: caption; steganalysis model; steganalysis; model based ... See more keywords

Multispectral Image Caption Unification Using Diffusion and Cycle GAN Models

Sign Up to like & get
recommendations!
Published in 2025 at "IEEE Access"

DOI: 10.1109/access.2025.3632152

Abstract: A major limitation is the scarcity of geospatial datasets that simultaneously provide multispectral imagery and descriptive captions. In particular, datasets containing aligned RGB, multispectral, and caption information remain highly limited. Therefore, we propose a full-circle… read more here.

Keywords: caption; diffusion; image; rgb images ... See more keywords
Photo from wikipedia

Global-Attention-Based Neural Networks for Vision Language Intelligence

Sign Up to like & get
recommendations!
Published in 2021 at "IEEE/CAA Journal of Automatica Sinica"

DOI: 10.1109/jas.2020.1003402

Abstract: In this paper, we develop a novel global-attention-based neural network (GANN) for vision language intelligence, specifically, image captioning (language description of a given image). As many previous works, the encoder-decoder framework is adopted in our… read more here.

Keywords: global attention; attention; feature; caption ... See more keywords

TypeFormer: Multiscale Transformer With Type Controller for Remote Sensing Image Caption

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Geoscience and Remote Sensing Letters"

DOI: 10.1109/lgrs.2022.3192062

Abstract: Image captioning in remote sensing can help us understand the inner attributes of the objects and the outer relationships between different objects. However, the existing image captioning algorithms lack the ability of global representation and… read more here.

Keywords: caption; remote sensing; type; image captioning ... See more keywords

Learning to Overcome Noise in Weak Caption Supervision for Object Detection

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Pattern Analysis and Machine Intelligence"

DOI: 10.1109/tpami.2022.3187350

Abstract: We propose the first mechanism to train object detection models from weak supervision in the form of captions at the image level. Language-based supervision for detection is appealing and inexpensive: many blogs with images and… read more here.

Keywords: caption; supervision; level; detection ... See more keywords

Research on the design of panoramic virtual learning environment screen elements

Sign Up to like & get
recommendations!
Published in 2024 at "Frontiers in Psychology"

DOI: 10.3389/fpsyg.2023.1314076

Abstract: Panoramic video and virtual reality technologies create learning environments that provide learners with an “immersive” experience. In recent years, panoramic video design to create immersive learning environments, in particular, has become an increasingly popular topic… read more here.

Keywords: caption; virtual learning; panoramic virtual; learning environment ... See more keywords