Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Wireless Communications"
DOI: 10.1109/mwc.008.2200180
Abstract: It is well known that multi-modal services, including video, audio, and haptic signals, aim to provide immersive experience with low latency and high reliability. Although multi-modal signals have differences in structure, transmission delay, and jitter,…
read more here.
Keywords:
modal semantic;
multi modal;
cross modal;
semantic communications ... See more keywords
Sign Up to like & get
recommendations!
2
Published in 2023 at "IEEE Transactions on Circuits and Systems for Video Technology"
DOI: 10.1109/tcsvt.2022.3220297
Abstract: Cross-modal image-text retrieval is an important area of Vision-and-Language task that models the similarity of image-text pairs by embedding features into a shared space for alignment. To bridge the heterogeneous gap between the two modalities,…
read more here.
Keywords:
cross modal;
image text;
importance;
modal ... See more keywords
Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE Transactions on Multimedia"
DOI: 10.1109/tmm.2021.3060291
Abstract: Synthesizing photo-realistic images based on text descriptions is a challenging image generation problem. Although many recent approaches have significantly advanced the performance of text-to-image generation, to guarantee semantic matchings between the text description and synthesized…
read more here.
Keywords:
cross modal;
modal semantic;
semantic matching;
text image ... See more keywords