Articles with "universal multimodal" as a keyword



Photo from wikipedia

Universal Multimodal Representation for Language Understanding

Sign Up to like & get
recommendations!
Published in 2023 at "IEEE transactions on pattern analysis and machine intelligence"

DOI: 10.1109/tpami.2023.3234170

Abstract: Representation learning is the foundation of natural language processing (NLP). This work presents new methods to employ visual information as assistant signals to general NLP tasks. For each sentence, we first retrieve a flexible number… read more here.

Keywords: language; image pairs; representation; universal multimodal ... See more keywords