LAUSR: temporal semantic

Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering

Sign Up to like & get
recommendations!
1 Published in 2022 at "IEEE Transactions on Image Processing"

DOI: 10.1109/tip.2022.3142526

Abstract: Due to the rich spatio-temporal visual content and complex multimodal relations, Video Question Answering (VideoQA) has become a challenging task and attracted increasing attention. Current methods usually leverage visual attention, linguistic attention, or self-attention to… read more here.

Keywords: video question; temporal semantic; spatio temporal; attention ... See more keywords

Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning

Sign Up to like & get
recommendations!
0 Published in 2024 at "IEEE Transactions on Image Processing"

DOI: 10.1109/tip.2024.3430080

Abstract: The spiking neural networks (SNNs) that efficiently encode temporal sequences have shown great potential in extracting audio-visual joint feature representations. However, coupling SNNs (binary spike sequences) with transformers (float-point sequences) to jointly explore the temporal-semantic… read more here.

Keywords: temporal semantic; audio visual; fusion; tucker fusion ... See more keywords

ACO-TSSCD: An Optimized Deep Multimodal Temporal Semantic Segmentation Change Detection Approach for Monitoring Agricultural Land Conversion

Sign Up to like & get
recommendations!
0 Published in 2024 at "Agronomy"

DOI: 10.3390/agronomy14122909

Abstract: With the acceleration of urbanization in agricultural areas and the continuous changes in land-use patterns, the transformation of agricultural land presents complexity and dynamism, which puts higher demands on precise monitoring. And most existing monitoring… read more here.

Keywords: agricultural land; temporal semantic; land; semantic segmentation ... See more keywords

LAUSR

You are not signed in:

Sign Up!

Cross-Attentional Spatio-Temporal Semantic Graph Networks for Video Question Answering

Spiking Tucker Fusion Transformer for Audio-Visual Zero-Shot Learning

ACO-TSSCD: An Optimized Deep Multimodal Temporal Semantic Segmentation Change Detection Approach for Monitoring Agricultural Land Conversion