Sign Up to like & get
recommendations!
1
Published in 2019 at "IEEE Transactions on Pattern Analysis and Machine Intelligence"
DOI: 10.1109/tpami.2018.2890628
Abstract: Recent insights on language and vision with neural networks have been successfully applied to simple single-image visual question answering. However, to tackle real-life question answering problems on multimedia collections such as personal photo albums, we…
read more here.
Keywords:
focal visual;
question;
grounding photos;
visual text ... See more keywords