Articles with "video modalities" as a keyword



UniSOT: A Unified Framework for Multi-Modality Single Object Tracking

Sign Up to like & get
recommendations!
Published in 2025 at "IEEE transactions on pattern analysis and machine intelligence"

DOI: 10.1109/tpami.2025.3615714

Abstract: Single object tracking aims to localize target object with specific reference modalities (bounding box, natural language or both) in a sequence of specific video modalities (RGB, RGB+Depth, RGB+Thermal or RGB+Event.). Different reference modalities enable various… read more here.

Keywords: reference modalities; language; video modalities; single object ... See more keywords