Cited By
View all- Wen ZNiu SLi GWu QTan MWu Q(2024)Test-Time Model Adaptation for Visual Question Answering With Debiased Self-SupervisionsIEEE Transactions on Multimedia10.1109/TMM.2023.329259726(2137-2147)Online publication date: 2024
Visual Relational Reasoning is crucial for many vision-and-language based tasks, such as Visual Question Answering and Vision Language Navigation. In this paper, we consider reasoning on complex referring expression comprehension (c-REF) task that ...
Visual relationship modeling plays an indispensable role in visual question answering (VQA). VQA models need to fully understand the visual scene and positional relationships within the image to answer complex reasoning questions involving visual ...
Association for Computing Machinery
New York, NY, United States
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in