LAUSR: visual textual

The visual, the textual, and the one‐dimensional: An exploration of the visual elements of bibliographic classification schemes

Sign Up to like & get
recommendations!
0 Published in 2025 at "Journal of the Association for Information Science and Technology"

DOI: 10.1002/asi.70001

Abstract: Classification schemes are a key way of organizing bibliographic knowledge, yet the way that classification schemes communicate their information to classifiers receives little attention. This article takes a novel approach by exploring the visual aspects… read more here.

Keywords: classification; classification scheme; visual elements; classification schemes ... See more keywords

Ensemble learning on visual and textual data for social image emotion classification

Sign Up to like & get
recommendations!
0 Published in 2019 at "International Journal of Machine Learning and Cybernetics"

DOI: 10.1007/s13042-017-0734-0

Abstract: Texts, images and other information are posted everyday on the social network and provides a large amount of multimodal data. The aim of this work is to investigate if combining and integrating both visual and… read more here.

Keywords: image emotion; visual textual; textual data; image ... See more keywords

GuideCAD: A Lightweight Multimodal Framework for 3D CAD Model Generation via Prefix Embedding

Sign Up to like & get
recommendations!
0 Published in 2025 at "IEEE Access"

DOI: 10.1109/access.2025.3604810

Abstract: Multi-modal approaches used for 3D CAD generation require substantial computational resources, necessitating efficient training. To address this, we propose GuideCAD, which leverages semantically rich visual-textual representations having only a small number of trainable parameters to… read more here.

Keywords: cad model; generation; guidecad lightweight; model ... See more keywords

A Simple Visual-Textual Baseline for Pedestrian Attribute Recognition

Sign Up to like & get
recommendations!
1 Published in 2022 at "IEEE Transactions on Circuits and Systems for Video Technology"

DOI: 10.1109/tcsvt.2022.3178144

Abstract: Pedestrian attribute recognition (PAR), which aims to identify attributes of the pedestrians captured in video surveillance, is a challenging task due to the poor quality of images and diverse spatial distribution among attributes. Existing methods… read more here.

Keywords: attribute; baseline; attribute recognition; pedestrian attribute ... See more keywords

Synergistic Prompting Learning for Human-Object Interaction Detection

Sign Up to like & get
recommendations!
0 Published in 2025 at "IEEE Transactions on Image Processing"

DOI: 10.1109/tip.2025.3607614

Abstract: Human-Object Interaction (HOI) detection, as a foundational task in human-centric understanding, aims to detect interactive triplets in real-world scenarios. To better distinguish diverse HOIs within an open-world context, current HOI detectors utilize pre-trained Visual-Language Models… read more here.

Keywords: detection; object interaction; human object; interaction ... See more keywords

Improving chart question answering through integration of figure captions and visual–textual co-attention

Sign Up to like & get
recommendations!
0 Published in 2025 at "Journal of Electronic Imaging"

DOI: 10.1117/1.jei.34.2.023012

Abstract: Abstract. The proposed work for chart question answering (CQA) addresses crucial aspects particularly those associated with questions requiring complex reasoning and visual references to charts and those related to optical character recognition (OCR) noise. Existing… read more here.

Keywords: textual attention; model; chart question; question ... See more keywords

LAUSR