LAUSR: video understanding

Photo from wikipedia

A study on deep learning spatiotemporal models and feature extraction techniques for video understanding

Sign Up to like & get
recommendations!
0 Published in 2020 at "International Journal of Multimedia Information Retrieval"

DOI: 10.1007/s13735-019-00190-x

Abstract: Video understanding requires abundant semantic information. Substantial progress has been made on deep learning models in the image, text, and audio domains, and notable efforts have been recently dedicated to the design of deep networks… read more here.

Keywords: models feature; deep learning; learning spatiotemporal; spatiotemporal models ... See more keywords

Photo from wikipedia

Gated PE-NL-MA: A multi-modal attention based network for video understanding

Sign Up to like & get
recommendations!
1 Published in 2021 at "Neurocomputing"

DOI: 10.1016/j.neucom.2020.05.112

Abstract: Abstract In multi-modal learning tasks such as video understanding, the most important operations are feature extraction and feature enhancement for single modality and feature aggregation between modalities. In this paper, we present two attention based… read more here.

Keywords: network; attention; multi modal; attention based ... See more keywords

LAUSR

You are not signed in:

Sign Up!

A study on deep learning spatiotemporal models and feature extraction techniques for video understanding

Gated PE-NL-MA: A multi-modal attention based network for video understanding