Sign Up to like & get
recommendations!
2
Published in 2022 at "IEEE Transactions on Circuits and Systems for Video Technology"
DOI: 10.1109/tcsvt.2021.3127149
Abstract: Convolutional neural networks (CNNs) are good at extracting contexture features within certain receptive fields, while transformers can model the global long-range dependency features. By absorbing the advantage of transformer and the merit of CNN, Swin…
read more here.
Keywords:
rgb rgb;
salient object;
rgb;
modality ... See more keywords