Grounding Symbols in Multi-Modal Instructions

Hristov, Yordan; Penkov, Svetlin; Lascarides, Alex; Ramamoorthy, Subramanian

Computer Science > Artificial Intelligence

arXiv:1706.00355 (cs)

[Submitted on 1 Jun 2017]

Title:Grounding Symbols in Multi-Modal Instructions

Authors:Yordan Hristov, Svetlin Penkov, Alex Lascarides, Subramanian Ramamoorthy

View PDF

Abstract:As robots begin to cohabit with humans in semi-structured environments, the need arises to understand instructions involving rich variability---for instance, learning to ground symbols in the physical world. Realistically, this task must cope with small datasets consisting of a particular users' contextual assignment of meaning to terms. We present a method for processing a raw stream of cross-modal input---i.e., linguistic instructions, visual perception of a scene and a concurrent trace of 3D eye tracking fixations---to produce the segmentation of objects with a correspondent association to high-level concepts. To test our framework we present experiments in a table-top object manipulation scenario. Our results show our model learns the user's notion of colour and shape from a small number of physical demonstrations, generalising to identifying physical referents for novel combinations of the words.

Comments:	9 pages, 8 figures, To appear in the Proceedings of the ACL workshop Language Grounding for Robotics, Vancouver, Canada
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1706.00355 [cs.AI]
	(or arXiv:1706.00355v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1706.00355

Submission history

From: Yordan Hristov [view email]
[v1] Thu, 1 Jun 2017 15:42:50 UTC (3,040 KB)

Computer Science > Artificial Intelligence

Title:Grounding Symbols in Multi-Modal Instructions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Grounding Symbols in Multi-Modal Instructions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators