A Comparison of Qualitative and Metric Spatial Relation Models for Scene Understanding

Authors

  • Akshaya Thippur KTH Royal Institute of Technology
  • Chris Burbridge University of Birmingham
  • Lars Kunze University of Birmingham
  • Marina Alberti KTH Royal Institute of Technology
  • John Folkesson KTH Royal Institute of Technology
  • Patric Jensfelt KTH Royal Institute of Technology
  • Nick Hawes University of Birmingham

DOI:

https://doi.org/10.1609/aaai.v29i1.9421

Keywords:

Spatial relations, Spatial contexts, Long-term Autonomy, Joint object classification

Abstract

Object recognition systems can be unreliable when run in isolation depending on only image based features, but their performance can be improved when taking scene context into account. In this paper, we present techniques to model and infer object labels in real scenes based on a variety of spatial relations — geometric features which capture how objects co-occur — and compare their efficacy in the context of augmenting perception based object classification in real-world table-top scenes. We utilise a long-term dataset of office table-tops for qualitatively comparing the performances of these techniques. On this dataset, we show that more intricate techniques, have a superior performance but do not generalise well on small training data. We also show that techniques using coarser information perform crudely but sufficiently well in standalone scenarios and generalise well on small training data. We conclude the paper, expanding on the insights we have gained through these comparisons and comment on a few fundamental topics with respect to long-term autonomous robots.

Downloads

Published

2015-02-18

How to Cite

Thippur, A., Burbridge, C., Kunze, L., Alberti, M., Folkesson, J., Jensfelt, P., & Hawes, N. (2015). A Comparison of Qualitative and Metric Spatial Relation Models for Scene Understanding. Proceedings of the AAAI Conference on Artificial Intelligence, 29(1). https://doi.org/10.1609/aaai.v29i1.9421

Issue

Section

AAAI Technical Track: Knowledge Representation and Reasoning