Articles with "large multimodal" as a keyword



MangaUB: A Manga Understanding Benchmark for Large Multimodal Models

Sign Up to like & get
recommendations!
Published in 2024 at "IEEE MultiMedia"

DOI: 10.1109/mmul.2025.3550451

Abstract: Manga is a popular medium that combines stylized drawings and text to convey stories. As manga panels differ from natural images, computational systems traditionally had to be designed specifically for manga. Recently, the adaptive nature… read more here.

Keywords: manga understanding; multimodal models; large multimodal; understanding benchmark ... See more keywords