Articles with "modal semantic" as a keyword



Photo by thanti_riess from unsplash

Cross-Modal Semantic Communications

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Wireless Communications"

DOI: 10.1109/mwc.008.2200180

Abstract: It is well known that multi-modal services, including video, audio, and haptic signals, aim to provide immersive experience with low latency and high reliability. Although multi-modal signals have differences in structure, transmission delay, and jitter,… read more here.

Keywords: modal semantic; multi modal; cross modal; semantic communications ... See more keywords
Photo from wikipedia

Image-Text Retrieval With Cross-Modal Semantic Importance Consistency

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3220297

Abstract: Cross-modal image-text retrieval is an important area of Vision-and-Language task that models the similarity of image-text pairs by embedding features into a shared space for alignment. To bridge the heterogeneous gap between the two modalities,… read more here.

Keywords: cross modal; image text; importance; modal ... See more keywords
Photo from wikipedia

Cross-Modal Semantic Matching Generative Adversarial Networks for Text-to-Image Synthesis

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Multimedia"

DOI: 10.1109/tmm.2021.3060291

Abstract: Synthesizing photo-realistic images based on text descriptions is a challenging image generation problem. Although many recent approaches have significantly advanced the performance of text-to-image generation, to guarantee semantic matchings between the text description and synthesized… read more here.

Keywords: cross modal; modal semantic; semantic matching; text image ... See more keywords