Sign Up to like & get
recommendations!
0
Published in 2024 at "IEEE Transactions on Pattern Analysis and Machine Intelligence"
DOI: 10.1109/tpami.2025.3528394
Abstract: Recently, video-language understanding has achieved great success through large-scale pre-training. However, data scarcity remains a prevailing challenge. This study quantitatively reveals an “impossible trinity” among data quantity, diversity, and quality in pre-training datasets. Recent efforts…
read more here.
Keywords:
video;
video dataflywheel;
video language;
language understanding ... See more keywords