Sign Up to like & get
recommendations!
2
Published in 2023 at "IEEE Transactions on Circuits and Systems for Video Technology"
DOI: 10.1109/tcsvt.2022.3220297
Abstract: Cross-modal image-text retrieval is an important area of Vision-and-Language task that models the similarity of image-text pairs by embedding features into a shared space for alignment. To bridge the heterogeneous gap between the two modalities,…
read more here.
Keywords:
cross modal;
image text;
importance;
modal ... See more keywords