Sign Up to like & get
recommendations!
1
Published in 2022 at "IEEE transactions on pattern analysis and machine intelligence"
DOI: 10.1109/tpami.2022.3173208
Abstract: Recent methods for visual question answering rely on large-scale annotated datasets. Manual annotation of questions and answers for videos, however, is tedious, expensive and prevents scalability. In this work, we propose to avoid manual annotation…
read more here.
Keywords:
video question;
question;
learning answer;
videoqa ... See more keywords