Articles with "video grounding" as a keyword



Photo by rouichi from unsplash

Intra- and Inter-modal Multilinear Pooling with Multitask Learning for Video Grounding

Sign Up to like & get
recommendations!
Published in 2020 at "Neural Processing Letters"

DOI: 10.1007/s11063-020-10205-y

Abstract: Video grounding aims to temporally localize an action in an untrimmed video referred to by a query in natural language, which plays an important role in fine-grained video understanding. Given temporal proposals of limited granularity,… read more here.

Keywords: inter modal; video; intra inter; video grounding ... See more keywords
Photo by jilburr from unsplash

Efficient Video Grounding With Which-Where Reading Comprehension

Sign Up to like & get
recommendations!
Published in 2022 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3174136

Abstract: Video grounding aims at localizing the temporal moment related to the given language description, which is very helpful to many cross-modal content understanding applications like visual question answering and sentence-video search. Existing approaches usually directly… read more here.

Keywords: efficient video; reading comprehension; video grounding; decision space ... See more keywords