Articles with "video understanding" as a keyword



Photo from wikipedia

A study on deep learning spatiotemporal models and feature extraction techniques for video understanding

Sign Up to like & get
recommendations!
Published in 2020 at "International Journal of Multimedia Information Retrieval"

DOI: 10.1007/s13735-019-00190-x

Abstract: Video understanding requires abundant semantic information. Substantial progress has been made on deep learning models in the image, text, and audio domains, and notable efforts have been recently dedicated to the design of deep networks… read more here.

Keywords: models feature; deep learning; learning spatiotemporal; spatiotemporal models ... See more keywords
Photo from wikipedia

Gated PE-NL-MA: A multi-modal attention based network for video understanding

Sign Up to like & get
recommendations!
Published in 2021 at "Neurocomputing"

DOI: 10.1016/j.neucom.2020.05.112

Abstract: Abstract In multi-modal learning tasks such as video understanding, the most important operations are feature extraction and feature enhancement for single modality and feature aggregation between modalities. In this paper, we present two attention based… read more here.

Keywords: network; attention; multi modal; attention based ... See more keywords