Sign Up to like & get
recommendations!
0
Published in 2024 at "IEEE Transactions on Circuits and Systems for Video Technology"
DOI: 10.1109/tcsvt.2024.3435561
Abstract: Understanding human intentions (e.g., emotions) from videos has received considerable attention recently. Video streams generally constitute a blend of temporal data stemming from distinct modalities, including natural language, facial expressions, and auditory clues. Despite the…
read more here.
Keywords:
modality exclusive;
learning modality;
sequence fusion;
modality ... See more keywords