Articles with "video text" as a keyword



Photo from wikipedia

Temporal Multimodal Graph Transformer With Global-Local Alignment for Video-Text Retrieval

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3207910

Abstract: Video-text retrieval is a crucial task that has been a powerful application for multi-media data analysis and attracted tremendous interest in the research area. The core steps are feature representations and alignment to overcome the… read more here.

Keywords: video text; text retrieval; video; local alignment ... See more keywords
Photo by maxchen2k from unsplash

Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals.

Sign Up to like & get
recommendations!
Published in 2020 at "IEEE transactions on neural networks and learning systems"

DOI: 10.1109/tnnls.2020.2997020

Abstract: Hashing has been widely applied to multimodal retrieval on large-scale multimedia data due to its efficiency in computation and storage. In this article, we propose a novel deep semantic multimodal hashing network (DSMHN) for scalable… read more here.

Keywords: retrieval; text; image text; video text ... See more keywords