LAUSR: position learning

Photo by hajjidirir from unsplash

Double-Stream Position Learning Transformer Network for Image Captioning

Sign Up to like & get
recommendations!
1 Published in 2022 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3181490

Abstract: Image captioning has made significant achievement through developing feature extractor and model architecture. Recently, the image region features extracted by object detector prevail in most existing models. However, region features are criticized for the lacking… read more here.

Keywords: position learning; stream; region; double stream ... See more keywords

LAUSR

You are not signed in:

Sign Up!

Double-Stream Position Learning Transformer Network for Image Captioning