Articles with "sequence fusion" as a keyword



Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2024.3435561

Abstract: Understanding human intentions (e.g., emotions) from videos has received considerable attention recently. Video streams generally constitute a blend of temporal data stemming from distinct modalities, including natural language, facial expressions, and auditory clues. Despite the… read more here.

Keywords: modality exclusive; learning modality; sequence fusion; modality ... See more keywords