Joint Discovery of Object States and Manipulation Actions

Alayrac, Jean-Baptiste; Sivic, Josev; Laptev, Ivan; Lacoste-Julien, Simon

Computer Science > Computer Vision and Pattern Recognition

arXiv:1702.02738 (cs)

[Submitted on 9 Feb 2017 (v1), last revised 28 Aug 2017 (this version, v3)]

Title:Joint Discovery of Object States and Manipulation Actions

Authors:Jean-Baptiste Alayrac, Josev Sivic, Ivan Laptev, Simon Lacoste-Julien

View PDF

Abstract:Many human activities involve object manipulations aiming to modify the object state. Examples of common state changes include full/empty bottle, open/closed door, and attached/detached car wheel. In this work, we seek to automatically discover the states of objects and the associated manipulation actions. Given a set of videos for a particular task, we propose a joint model that learns to identify object states and to localize state-modifying actions. Our model is formulated as a discriminative clustering cost with constraints. We assume a consistent temporal order for the changes in object states and manipulation actions, and introduce new optimization techniques to learn model parameters without additional supervision. We demonstrate successful discovery of seven manipulation actions and corresponding object states on a new dataset of videos depicting real-life object manipulations. We show that our joint formulation results in an improvement of object state discovery by action recognition and vice versa.

Comments:	Appears in: International Conference on Computer Vision 2017 (ICCV 2017). 15 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
ACM classes:	I.5.1; I.5.4; I.2
Cite as:	arXiv:1702.02738 [cs.CV]
	(or arXiv:1702.02738v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1702.02738

Submission history

From: Jean-Baptiste Alayrac [view email]
[v1] Thu, 9 Feb 2017 08:04:33 UTC (2,090 KB)
[v2] Mon, 10 Apr 2017 08:23:00 UTC (1,965 KB)
[v3] Mon, 28 Aug 2017 08:04:18 UTC (3,190 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Joint Discovery of Object States and Manipulation Actions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Joint Discovery of Object States and Manipulation Actions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators