O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning

Mo, Kaichun; Qin, Yuzhe; Xiang, Fanbo; Su, Hao; Guibas, Leonidas

Computer Science > Computer Vision and Pattern Recognition

arXiv:2106.15087v1 (cs)

[Submitted on 29 Jun 2021 (this version), latest version 25 Oct 2021 (v2)]

Title:O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning

Authors:Kaichun Mo, Yuzhe Qin, Fanbo Xiang, Hao Su, Leonidas Guibas

View PDF

Abstract:Contrary to the vast literature in modeling, perceiving, and understanding agent-object (e.g., human-object, hand-object, robot-object) interaction in computer vision and robotics, very few past works have studied the task of object-object interaction, which also plays an important role in robotic manipulation and planning tasks. There is a rich space of object-object interaction scenarios in our daily life, such as placing an object on a messy tabletop, fitting an object inside a drawer, pushing an object using a tool, etc. In this paper, we propose a unified affordance learning framework to learn object-object interaction for various tasks. By constructing four object-object interaction task environments using physical simulation (SAPIEN) and thousands of ShapeNet models with rich geometric diversity, we are able to conduct large-scale object-object affordance learning without the need for human annotations or demonstrations. At the core of technical contribution, we propose an object-kernel point convolution network to reason about detailed interaction between two objects. Experiments on large-scale synthetic data and real-world data prove the effectiveness of the proposed approach. Please refer to the project webpage for code, data, video, and more materials: this https URL

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2106.15087 [cs.CV]
	(or arXiv:2106.15087v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2106.15087

Submission history

From: Kaichun Mo [view email]
[v1] Tue, 29 Jun 2021 04:38:12 UTC (4,052 KB)
[v2] Mon, 25 Oct 2021 21:26:44 UTC (3,878 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:O2O-Afford: Annotation-Free Large-Scale Object-Object Affordance Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators