Articles with "position learning" as a keyword



Photo by hajjidirir from unsplash

Double-Stream Position Learning Transformer Network for Image Captioning

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3181490

Abstract: Image captioning has made significant achievement through developing feature extractor and model architecture. Recently, the image region features extracted by object detector prevail in most existing models. However, region features are criticized for the lacking… read more here.

Keywords: position learning; stream; region; double stream ... See more keywords