Articles with "video language" as a keyword



Surveillance Video-and-Language Understanding: From Small to Large Multimodal Models

Sign Up to like & get
recommendations!
Published in 2025 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2024.3462433

Abstract: Surveillance videos play a crucial role in public security. However, current tasks related to surveillance videos primarily focus on classifying and localizing anomalous events. Despite achieving notable performance, existing methods are restricted to detecting and… read more here.

Keywords: dataset; surveillance; video language; language ... See more keywords

Video DataFlywheel: Resolving the Impossible Data Trinity in Video-Language Understanding

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE Transactions on Pattern Analysis and Machine Intelligence"

DOI: 10.1109/tpami.2025.3528394

Abstract: Recently, video-language understanding has achieved great success through large-scale pre-training. However, data scarcity remains a prevailing challenge. This study quantitatively reveals an “impossible trinity” among data quantity, diversity, and quality in pre-training datasets. Recent efforts… read more here.

Keywords: video; video dataflywheel; video language; language understanding ... See more keywords