Abstract
Moving object segmentation from an image sequence is essential for a robot to interact with its environment. Traditional vision approaches appeal to pure motion analysis on videos without exploiting the source of the background motion. We observe, however, that the background motion (from the robot’s egocentric view) has stronger correlation to the robot’s motor signals than the foreground motion. We propose a novel approach to detecting moving objects by clustering features into background and foreground according to their motion consistency with motor signals. Specifically, our approach learns homography and fundamental matrices as functions of motor signals, and predict sparse feature locations from the learned matrices. The errors between the predictions and their actual tracked locations are used to label them into background and foreground. The labels of the sparse features are then propagated to all pixels. Our approach does not require building a dense mosaic background or searching for affine, homography, or fundamental matrix parameters for foreground separation. In addition, it does not need to explicitly model the intrinsic and extrinsic calibration parameters hence requires much less prior geometry knowledge. It works completely in 2D image space, and does not involve any complex analysis or computation in 3D space.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Wren, C., Azarbayejani, A., Darrell, T., Pentland, A.: Pfinder: Real-time tracking of the human body. PAMI (1997)
Stauffer, C., Grimson, W.: Learning patterns of activity using real-time tracking. PAMI (2000)
Elgammal, A., Harwood, D., Davis, L.: Non-parametric Model for Background Subtraction. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1843, pp. 751–767. Springer, Heidelberg (2000)
Mittal, A., Paragios, N.: Motion-based background subtraction using adaptive kernel density estimation. In: CVPR (2004)
Monnet, A., Mittal, A., Paragios, N., Ramesh, V.: Background modeling and subtraction of dynamic scenes. In: ICCV (2008)
Ko, T., Soatto, S., Estrin, D.: Warping background subtraction. In: CVPR (2010)
McKenna, S., Jabri, S., Duric, Z., Rosenfeld, A., Wechsler, H.: Tracking groups of people. In: CVIU (2000)
Hayman, E., Eklundh, J.: Statistical background subtraction for a mobile observer. In: ICCV (2003)
Mittal, A., Huttenlocher, D.: Scene modeling for wide area surveillance and image synthesis. In: CVPR (2000)
Brown, M., Lowe, D.: Recognising panoramas. In: ICCV (2003)
Szeliski, R.: Image alignment and stitching: A tutorial. In: Foundations and Trends in Computer Graphics and Vision (2006)
Ren, X., Gu, C.: Figure-ground segmentation improves handled object recognition in egocentric video. In: CVPR (2010)
Han, M., Xu, W., Gong, Y.: Video object segmentation by motion-based sequential feature clustering. In: ACM International Conference on Multimedia (2006)
Sivic, J., Schaffalitzky, F., Zisserman, A.: Object level grouping for video shots. IJCV (2006)
Xiao, J., Shah, M.: Motion layer extraction in the presence of occlusion using graph cuts. PAMI (2005)
Sheikh, Y., Javed, O., Kanade, T.: Background subtraction for freely moving cameras. In: ICCV (2009)
Lee, Y., Kim, J., Grauman, K.: Key-segments for video object segmentation. In: ICCV (2011)
Ren, X., Malik, J.: Tracking as repeated figure/ground segmentation. In: CVPR (2007)
Galletti, C., Fattori, P.: Neuronal mechanisms for detection of motion in the field of view. Neuropsychologia (2003)
Knight, J., Reid, I.: Automated alignment of robotic pan-tilt camera units using vision. IJCV (2006)
Hwangbo, M., Kim, J., Kanade, T.: Inertial-aided klt feature tracking for a moving camera. IROS (2009)
Isard, M., Blake, A.: CONDENSATION - conditional density propagation for visual tracking. IJCV (1998)
Kass, M., Witkin, A., Terzopoulos, D.: Snakes: Active contour models. IJCV (1988)
Hartley, R., Zisserman, A.: Multiple view geometry in computer vision. Cambridge University Press (2003)
Ma, Y.: An invitation to 3-D vision: From images to geometric models. Springer (2004)
Shi, J., Tomasi, C.: Good features to track. In: CVPR (1994)
Xu, C., Liu, J., Kuipers, B.: Motion segmentation by learning homography matrices from motor signals. In: CRV (2011)
Joachims, T.: Transductive learning via spectral graph partitioning. In: ICML (2003)
Dhillon, I.: Co-clustering documents and words using bipartite spectral graph partitioning. In: SIGKDD (2001)
Shi, J., Malik, J.: Normalized cuts and image segmentation. PAMI (2000)
Uemura, H., Ishikawa, S., Mikolajczyk, K.: Feature tracking and motion compensation for action recognition. In: BMVC (2008)
Goshen, L., Shimshoni, I.: Guided sampling via weak motion models and outlier sample generation for epipolar geometry estimation. IJCV (2008)
Gulshan, V., Rother, C., Criminisi, A., Blake, A., Zisserman, A.: Geodesic star convexity for interactive image segmentation. In: CVPR (2010)
Sivic, J., Russell, B., Zisserman, A., Freeman, W., Efros, A.: Unsupervised discovery of visual object class hierarchies. In: CVPR (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, C., Liu, J., Kuipers, B. (2012). Moving Object Segmentation Using Motor Signals. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7576. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33715-4_49
Download citation
DOI: https://doi.org/10.1007/978-3-642-33715-4_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33714-7
Online ISBN: 978-3-642-33715-4
eBook Packages: Computer ScienceComputer Science (R0)