Affordance Transfer Learning for Human-Object Interaction Detection

Hou, Zhi; Yu, Baosheng; Qiao, Yu; Peng, Xiaojiang; Tao, Dacheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2104.02867 (cs)

[Submitted on 7 Apr 2021 (v1), last revised 9 Jun 2021 (this version, v2)]

Title:Affordance Transfer Learning for Human-Object Interaction Detection

Authors:Zhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao

View PDF

Abstract:Reasoning the human-object interactions (HOI) is essential for deeper scene understanding, while object affordances (or functionalities) are of great importance for human to discover unseen HOIs with novel objects. Inspired by this, we introduce an affordance transfer learning approach to jointly detect HOIs with novel objects and recognize affordances. Specifically, HOI representations can be decoupled into a combination of affordance and object representations, making it possible to compose novel interactions by combining affordance representations and novel object representations from additional images, i.e. transferring the affordance to novel objects. With the proposed affordance transfer learning, the model is also capable of inferring the affordances of novel objects from known affordance representations. The proposed method can thus be used to 1) improve the performance of HOI detection, especially for the HOIs with unseen objects; and 2) infer the affordances of novel objects. Experimental results on two datasets, HICO-DET and HOI-COCO (from V-COCO), demonstrate significant improvements over recent state-of-the-art methods for HOI detection and object affordance detection. Code is available at this https URL

Comments:	Accepted to CVPR2021; add a new but important ablated experiment in appendix(union box verb representation);
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2104.02867 [cs.CV]
	(or arXiv:2104.02867v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2104.02867

Submission history

From: Zhi Hou [view email]
[v1] Wed, 7 Apr 2021 02:37:04 UTC (4,187 KB)
[v2] Wed, 9 Jun 2021 06:02:11 UTC (4,205 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Affordance Transfer Learning for Human-Object Interaction Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Affordance Transfer Learning for Human-Object Interaction Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators