Sign Up to like & get
recommendations!
0
Published in 2024 at "Multimedia Tools and Applications"
DOI: 10.1007/s11042-023-17977-0
Abstract: With the increasing importance of multimedia and multilingual data in online encyclopedias, novel methods are needed to fill domain gaps and automatically connect different modalities for increased accessibility. For example, Wikipedia is composed of millions…
read more here.
Keywords:
cascaded transformer;
caption;
image;
caption matching ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2018 at "Neurocomputing"
DOI: 10.1016/j.neucom.2018.05.080
Abstract: Abstract Image captioning means automatically generating a caption for an image. As a recently emerged research area, it is attracting more and more attention. To achieve the goal of image captioning, semantic information of images…
read more here.
Keywords:
image;
research;
image captioning;
survey ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2019 at "Neurocomputing"
DOI: 10.1016/j.neucom.2018.12.026
Abstract: Abstract Automatic generation of caption to describe the content of an image has been gaining a lot of research interests recently, where most of the existing works treat the image caption as pure sequential data.…
read more here.
Keywords:
based image;
image caption;
image;
phrase based ... See more keywords
Photo from wikipedia
Sign Up to like & get
recommendations!
1
Published in 2018 at "Natural Language Engineering"
DOI: 10.1017/s1351324918000098
Abstract: Abstract When a recurrent neural network (RNN) language model is used for caption generation, the image information can be fed to the neural network either by directly incorporating it in the RNN – conditioning the…
read more here.
Keywords:
image;
rnn;
caption;
language model ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2024 at "IEEE Access"
DOI: 10.1109/access.2024.3519094
Abstract: Recently, steganalysis of Voice over Internet Protocol (VoIP) compressed speech has gained attention. In real voice communication, Joint Parallel Steganography (JPS) often occurs, where multiple steganography algorithms coexist. The multifaceted nature of JPS, incorporating various…
read more here.
Keywords:
caption;
steganalysis model;
steganalysis;
model based ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2025 at "IEEE Access"
DOI: 10.1109/access.2025.3632152
Abstract: A major limitation is the scarcity of geospatial datasets that simultaneously provide multispectral imagery and descriptive captions. In particular, datasets containing aligned RGB, multispectral, and caption information remain highly limited. Therefore, we propose a full-circle…
read more here.
Keywords:
caption;
diffusion;
image;
rgb images ... See more keywords
Photo from wikipedia
Sign Up to like & get
recommendations!
0
Published in 2021 at "IEEE/CAA Journal of Automatica Sinica"
DOI: 10.1109/jas.2020.1003402
Abstract: In this paper, we develop a novel global-attention-based neural network (GANN) for vision language intelligence, specifically, image captioning (language description of a given image). As many previous works, the encoder-decoder framework is adopted in our…
read more here.
Keywords:
global attention;
attention;
feature;
caption ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Geoscience and Remote Sensing Letters"
DOI: 10.1109/lgrs.2022.3192062
Abstract: Image captioning in remote sensing can help us understand the inner attributes of the objects and the outer relationships between different objects. However, the existing image captioning algorithms lack the ability of global representation and…
read more here.
Keywords:
caption;
remote sensing;
type;
image captioning ... See more keywords
Sign Up to like & get
recommendations!
2
Published in 2022 at "IEEE Transactions on Pattern Analysis and Machine Intelligence"
DOI: 10.1109/tpami.2022.3187350
Abstract: We propose the first mechanism to train object detection models from weak supervision in the form of captions at the image level. Language-based supervision for detection is appealing and inexpensive: many blogs with images and…
read more here.
Keywords:
caption;
supervision;
level;
detection ... See more keywords
Sign Up to like & get
recommendations!
0
Published in 2024 at "Frontiers in Psychology"
DOI: 10.3389/fpsyg.2023.1314076
Abstract: Panoramic video and virtual reality technologies create learning environments that provide learners with an “immersive” experience. In recent years, panoramic video design to create immersive learning environments, in particular, has become an increasingly popular topic…
read more here.
Keywords:
caption;
virtual learning;
panoramic virtual;
learning environment ... See more keywords